OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

office message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: [OASIS Issue Tracker] Issue Comment Edited: (OFFICE-2672) PublicComment: Text in OpenFormula - inadequate for international use



    [ http://tools.oasis-open.org/issues/browse/OFFICE-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=23098#action_23098 ] 

Dennis Hamilton edited comment on OFFICE-2672 at 11/8/10 1:14 AM:
------------------------------------------------------------------

Ignoring that XML 1.0 also recommends that certain characters in the list you gave be avoided, what are never permissible are any characters from #x0-#x1F other than #x9, #xA, and #xD.

The wording about legal characters in the definition is strange because U+0000 to U+001F are all all legal characters of Unicode.  It is clear that [XML1.0] only accept 3 of them as legal though.  

[DIGGING DEEPER (not essential to this comment):
  This prohibition of those ASCII-defined characters can't be because of their control functions, because #x7F to #x9F are allowed, and the Unicode 4 specification, which I am consulting, defines all control codes as characters recognized in plaintext, even if Unicode itself gives them no specific function.

My Unicode 4 book also points out that there are 66 code points that are non-characters.  This includes the U+FFFE and U+FFFF, but also every U+xxFFFE and U+xxFFFF where xx = 01 to 10 (hexadecimal).  According to the conformance requirements for Unicode 4 (I con't have the ISO version to compare with), non-characters should not be removed and consumers shoudl ignore/remove them.

Not sure there is anything to say about this situation except that there are other codes that may be difficult to produce and consume that [XML1.0] fails to single out.  It makes the definition in [XML1.0] difficult to parse, and it may be because Unicode has evolved faster.  I can't tell.  I know we are up to Unicode 6 now but I haven't dug farther into this matter. ]

      was (Author: orcmid):
    Ignoring that XML 1.0 also recommends that certain characters in the list you gave be avoided, what are never permissible are any characters from #x0-#x1F other than #x9, #xA, and #xD.

The wording about legal characters in the definition is strange because U+0000 to U+001F are all all legal characters of Unicode.  It is clear that they mean to accept only 3 of them as legal though..
  
> Public Comment: Text in OpenFormula - inadequate for international use
> ----------------------------------------------------------------------
>
>                 Key: OFFICE-2672
>                 URL: http://tools.oasis-open.org/issues/browse/OFFICE-2672
>             Project: OASIS Open Document Format for Office Applications (OpenDocument) TC
>          Issue Type: Bug
>          Components: OpenFormula
>    Affects Versions: ODF 1.2 Part 2 CD 2
>            Reporter: Robert Weir 
>            Assignee: Andreas Guelzow 
>             Fix For: ODF 1.2 CD 06
>
>
> Copied from office-comment list
> Original author: Alex Brown <alexb@griffinbrown.co.uk> 
> Original date: 5 May 2010 10:45:54 -0000
> Original URL: http://lists.oasis-open.org/archives/office-comment/201005/msg00002.html

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://tools.oasis-open.org/issues/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]