OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

office-comment message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [office-comment] textEncoding / x-symbol #1 (ODF all versions)


The basic resource for standards-drafters here is Dan Conolley's 
"Character Set considered harmful"
http://www.w3.org/MarkUp/html-spec/charset-harmful.html
 
Some older systems map the Symbol font to a contigous range of 
application-internal characters, because this supposedly made processing 
easier: frequently they would just re-use ASCII. However, that comes 
from the pre-Unicode days where characters were selected by changing 
fonts. Any vestiges of this should be removed from ODF: it was an 
obsolete hack 15 years ago. 

There are, however, legitimate uses for the PUA. It has been common for 
CJK users (well, perhaps not in the PRC!) to add extra characters to 
fonts when they need them. Indeed, CJK applications may even have 
built-in font editors to cope with these. Unicode reduces the need for 
these (but does not fully remove it.) However, it does not look like 
the  range is being used for that purpose here.

By the way, the character range should be U+F000 to U+F0FF, in preferred 
Unicode notation.

Cheers
Rick Jelliffe 


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]