[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: [OASIS Issue Tracker] Issue Comment Edited: (OFFICE-2672) PublicComment: Text in OpenFormula - inadequate for international use
[ http://tools.oasis-open.org/issues/browse/OFFICE-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=23098#action_23098 ] Dennis Hamilton edited comment on OFFICE-2672 at 11/8/10 1:14 AM: ------------------------------------------------------------------ Ignoring that XML 1.0 also recommends that certain characters in the list you gave be avoided, what are never permissible are any characters from #x0-#x1F other than #x9, #xA, and #xD. The wording about legal characters in the definition is strange because U+0000 to U+001F are all all legal characters of Unicode. It is clear that [XML1.0] only accept 3 of them as legal though. [DIGGING DEEPER (not essential to this comment): This prohibition of those ASCII-defined characters can't be because of their control functions, because #x7F to #x9F are allowed, and the Unicode 4 specification, which I am consulting, defines all control codes as characters recognized in plaintext, even if Unicode itself gives them no specific function. My Unicode 4 book also points out that there are 66 code points that are non-characters. This includes the U+FFFE and U+FFFF, but also every U+xxFFFE and U+xxFFFF where xx = 01 to 10 (hexadecimal). According to the conformance requirements for Unicode 4 (I con't have the ISO version to compare with), non-characters should not be removed and consumers shoudl ignore/remove them. Not sure there is anything to say about this situation except that there are other codes that may be difficult to produce and consume that [XML1.0] fails to single out. It makes the definition in [XML1.0] difficult to parse, and it may be because Unicode has evolved faster. I can't tell. I know we are up to Unicode 6 now but I haven't dug farther into this matter. ] was (Author: orcmid): Ignoring that XML 1.0 also recommends that certain characters in the list you gave be avoided, what are never permissible are any characters from #x0-#x1F other than #x9, #xA, and #xD. The wording about legal characters in the definition is strange because U+0000 to U+001F are all all legal characters of Unicode. It is clear that they mean to accept only 3 of them as legal though.. > Public Comment: Text in OpenFormula - inadequate for international use > ---------------------------------------------------------------------- > > Key: OFFICE-2672 > URL: http://tools.oasis-open.org/issues/browse/OFFICE-2672 > Project: OASIS Open Document Format for Office Applications (OpenDocument) TC > Issue Type: Bug > Components: OpenFormula > Affects Versions: ODF 1.2 Part 2 CD 2 > Reporter: Robert Weir > Assignee: Andreas Guelzow > Fix For: ODF 1.2 CD 06 > > > Copied from office-comment list > Original author: Alex Brown <alexb@griffinbrown.co.uk> > Original date: 5 May 2010 10:45:54 -0000 > Original URL: http://lists.oasis-open.org/archives/office-comment/201005/msg00002.html -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://tools.oasis-open.org/issues/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]