[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]
Subject: Re: [xmlvoc] Def: character-set
hi. > A set of abstract characters with an integer value for each > character, which is used to represent it. An abstract character > corresponds closely to the common-sense concept of a letter, but > also includes punctuation, digits, whitespace, and other special > symbols. Abstract characters are independent of any specific visual > design, which is left for fonts to provide. > > If we want more information we can add this: > > The integer values are usually chosen to allow character sequences > to be efficiently encoded as a sequence of bytes, and to ensure that > the resulting sequences have desirable byte signatures, especially > in order to ensure compatibility with other character sets and > encodings. shouldn't there be a clear distinction between the coded character set and the character encoding scheme? unicode's division into five different categories (http://www.unicode.org/reports/tr17/) may be too much, but i believe the ccs/ces separation is a good thing to do and helps people to understand unicode better. cheers, erik wilde - tel:+41-1-6325132 - fax:+41-1-6321035 mailto:net.dret@dret.net - http://dret.net computer engineering and networks laboratory swiss federal institute of technology (eth) * try not. do, or do not. there is no try. *
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]