OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

xliff message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [xliff] XLIFF validation on invalid XML character using <cp>


>Are you sure that D800 should be considered valid?

No, it's an invalid code point in XML, so XLIFF must use <cp href="" in this case. This is similar to <cp href="" etc.

Use of <cp> for valid XML character is not allowed - for example, <cp href="" in XLIFF is invalid. Your validator successfully returns an error for this.
Use of <cp> for invalid XML character is required - for example, <cp href="" <cp href="" <cp href="" <cp href="" in XLIFF is valid.

-Yoshito




From:        "Rodolfo M. Raya" <rmraya@maxprograms.com>
To:        "XLIFF TC" <xliff@lists.oasis-open.org>
Date:        05/26/2021 01:50 PM
Subject:        [EXTERNAL] RE: [xliff] XLIFF validation on invalid XML character using <cp>
Sent by:        <xliff@lists.oasis-open.org>




Hello Yoshito, At https://www.w3.org/TR/xml/#charsetsI found that the valid character ranges for XML are: Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] ‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍‍ZjQcmQRYFpfptBannerStart
This Message Is From an External Sender

This message came from outside your organization.


ZjQcmQRYFpfptBannerEnd

Hello Yoshito,

At https://www.w3.org/TR/xml/#charsetsI found that the valid character ranges for XML are:

Char    ::=    #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF]

Are you sure that D800 should be considered valid?

Regards,
Rodolfo
--
Rodolfo M. Raya
Maxprograms
http://www.maxprograms.com

-------- Original Message --------
Subject: [xliff] XLIFF validation on invalid XML character using <cp>
From: "Yoshito Umaoka" <
yoshito_umaoka@us.ibm.com>
Date: Wed, May 26, 2021 2:30 pm
To: "Rodolfo M. Raya" <
rmraya@maxprograms.com>
Cc: <
xliff@lists.oasis-open.org>

Hi Rodolfo,


I was playing with XLIFF validation service today -
https://dev.maxprograms.com/Validation/

---

<xliff
xmlns="urn:oasis:names:tc:xliff:document:2.0"version="2.0"srcLang="en"trgLang="fr">
<file
id="f1">
 <unit
id="u1">
   <segment
id="s1">
     <source>
Test <cphex="D800"/></source>
   </segment>
 </unit>
</file>
</xliff>

---


The validation service returns


---

File "test-cp.xliff" is not valid XLIFF.

Reason: Valid XML character represented as <cp>

---


However, this content should be valid. Unicode code point U+D800 (to U+DB7F) is reserved as High Surrogates. The code point does not represent a valid XML character, so <cp hex="D800"/> should be valid. (Of course, <cp hex="D7FF"/> is invalid). I guess the validation service does not check the range properly.


Thanks,
Yoshito


--------------------------------------------------------------------- To unsubscribe from this mail list, you must leave the OASIS TC that generates this mail. Follow this link to all your TCs in OASIS at: https://www.oasis-open.org/apps/org/workgroup/portal/my_workgroups.php





[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]