OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

xliff message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [xliff] XLIFF validation on invalid XML character using <cp>


Sorry, I just realized that D800 is not a valid XML character and thereafter it must be represented as <cp>, making your example a valid one.

Regards,
Rodolfo
--
Rodolfo M. Raya <rmraya@maxprograms.com>
Maxprograms http://www.maxprograms.com

-------- Original Message --------
Subject: RE: [xliff] XLIFF validation on invalid XML character using
<cp>
From: "Rodolfo M. Raya" <rmraya@maxprograms.com>
Date: Wed, May 26, 2021 2:50 pm
To: "XLIFF TC" <xliff@lists.oasis-open.org>


Hello Yoshito,

At https://www.w3.org/TR/xml/#charsets I found that the valid character ranges for XML are:

Char    ::=    #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF]

Are you sure that D800 should be considered valid?

Regards,
Rodolfo
--
Rodolfo M. Raya <rmraya@maxprograms.com>
Maxprograms http://www.maxprograms.com

-------- Original Message --------
Subject: [xliff] XLIFF validation on invalid XML character using <cp>
From: "Yoshito Umaoka" <yoshito_umaoka@us.ibm.com>
Date: Wed, May 26, 2021 2:30 pm
To: "Rodolfo M. Raya" <rmraya@maxprograms.com>
Cc: <xliff@lists.oasis-open.org>

Hi Rodolfo,

I was playing with XLIFF validation service today - https://dev.maxprograms.com/Validation/

---
<xliffxmlns="urn:oasis:names:tc:xliff:document:2.0"version="2.0"srcLang="en"trgLang="fr">
<fileid="f1">
  <unit id="u1">
    <segmentid="s1">
      <source>Test <cphex="D800"/></source>
    </segment>
  </unit>
</file>
</xliff>
---

The validation service returns

---
File "test-cp.xliff" is not valid XLIFF.

Reason: Valid XML character represented as <cp>

---

However, this content should be valid. Unicode code point U+D800 (to U+DB7F) is reserved as High Surrogates. The code point does not represent a valid XML character, so <cp hex="D800"/> should be valid. (Of course, <cp hex="D7FF"/> is invalid). I guess the validation service does not check the range properly.

Thanks,
Yoshito

--------------------------------------------------------------------- To unsubscribe from this mail list, you must leave the OASIS TC that generates this mail. Follow this link to all your TCs in OASIS at: https://www.oasis-open.org/apps/org/workgroup/portal/my_workgroups.php


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]