OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

xmlvoc message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]

Subject: Re: [xmlvoc] Def: character-set


>   A set of abstract characters with an integer value for each
>   character, which is used to represent it. An abstract character
>   corresponds closely to the common-sense concept of a letter, but
>   also includes punctuation, digits, whitespace, and other special
>   symbols. Abstract characters are independent of any specific visual
>   design, which is left for fonts to provide.
> If we want more information we can add this:
>   The integer values are usually chosen to allow character sequences
>   to be efficiently encoded as a sequence of bytes, and to ensure that
>   the resulting sequences have desirable byte signatures, especially
>   in order to ensure compatibility with other character sets and
>   encodings. 

shouldn't there be a clear distinction between the coded character set 
and the character encoding scheme? unicode's division into five 
different categories (http://www.unicode.org/reports/tr17/) may be too 
much, but i believe the ccs/ces separation is a good thing to do and 
helps people to understand unicode better.


erik wilde  -  tel:+41-1-6325132  -  fax:+41-1-6321035
           mailto:net.dret@dret.net  -  http://dret.net
           computer engineering and networks laboratory
           swiss federal institute of technology  (eth)
           * try not. do, or do not. there is no try. *

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]