[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: RE: [xliff] RE: [xliff-inline] Req 1.15 Representation of invalid XML characters
Hi Yves, Here are a couple of changes in the XML Schema that may help: 1) Create a restriction for the data type <xs:simpleType name="hexValue"> <xs:restriction base="xs:hexBinary"> <xs:pattern value="[0000-0010FFFF]"/> </xs:restriction> </xs:simpleType> 2) Define the element using the restriction <xs:element name="cp"> <!-- Code Point --> <xs:complexType mixed="false"> <xs:attribute name="hex" use="required" type="xlf:hexValue"/> </xs:complexType> </xs:element> The above definition allows all values from 0000 to 0010FFFF, which includes character ranges that are valid in XML. IMHO, valid XML characters should not be allowed. Regards, Rodolfo -- Rodolfo M. Raya <rmraya@maxprograms.com> Maxprograms http://www.maxprograms.com > -----Original Message----- > From: Yves Savourel [mailto:ysavourel@enlaso.com] > Sent: Monday, September 12, 2011 1:58 PM > To: xliff-inline@lists.oasis-open.org > Cc: xliff@lists.oasis-open.org > Subject: [xliff] RE: [xliff-inline] Req 1.15 Representation of invalid XML > characters > > Hi David, Steven, Helena, all > > In our discussion about how to represent characters invalid in XML in XLIFF > we've adopted an element similar to LDML's cp. > > In the processing expectation we are trying to decide what the user agent is > suppose to do when the hex attribute value is invalid (e.g. hex='qwerty'). > > Christian suggested to reach out to LDML for some ideas as this may have > been discussed there already. > David, Stevens, Helena: Any thought? > > I'm guessing Stevens may be more involved with LDML than David or Helena > (pure speculation from me). > I'm adding the TC mailing list on the thread, so he can see and post an answer > if needed. (joining the SC to be able to post there is the other option) > > Below is an extract of our latest exchange. > You can see all the emails here: > http://lists.oasis-open.org/archives/xliff-inline/ > (search for the one with "1.15 Representation of invalid XML characters" in > their title) > > > > Maybe: "If the value of the hex attribute is invalid, > > the Readers MUST generate an error and MAY terminate > > the process. This specification does not prescribe how > > invalid <cp> values are represented in the parsed content." > > > > But I still think it would be better to have an expected > > behavior: it helps interoperability. U+FFFD seems to be > > applicable for such case according to > > > http://en.wikipedia.org/wiki/Replacement_character#Replacement_charact > er). > > > CL> I would be tempted to reach out to someone from LDML > CL> (or general Unicode) to get guidance. > > > Any pointer would be welcome, > > Cheers, > -yves > > > > > --------------------------------------------------------------------- > To unsubscribe from this mail list, you must leave the OASIS TC that > generates this mail. Follow this link to all your TCs in OASIS at: > https://www.oasis- > open.org/apps/org/workgroup/portal/my_workgroups.php
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]