done.
Good catch. We should also follow the usage convention
for this attribute: The ISO two-letter language code should be lowercase and
the country code should be uppercase ("en-US" not "en-us"). None of the
existing examples follow this convention.
thanx,
doug
----- Original Message -----
Sent: Wednesday, 12 December 2001 10:51
Subject: [ebxml-msg] language codes
I think the
language codes used in some examples are backwards. They should be "en-US" and
not "us-en". See link below. I included the content of the important
section.
2.12 Language Identification
In document processing, it is often useful to identify the natural or
formal language in which the content is written. A special attribute named
xml:lang may be inserted in documents to specify the language
used in the contents and attribute values of any element in an XML document.
In valid documents, this attribute, like any other, must be declared if it is used. The
values of the attribute are language identifiers as defined by [IETF RFC 1766], Tags
for the Identification of Languages, or its successor on the IETF
Standards Track.
Note:
[IETF RFC 1766] tags are
constructed from two-letter language codes as defined by [ISO 639], from two-letter
country codes as defined by [ISO 3166], or from language
identifiers registered with the Internet Assigned Numbers Authority [IANA-LANGCODES]. It is
expected that the successor to [IETF RFC 1766] will introduce
three-letter language codes for languages not presently covered by [ISO
639].
For example:
<p xml:lang="en">The quick brown fox jumps over the lazy dog.</p>
<p xml:lang="en-GB">What colour is it?</p>
<p xml:lang="en-US">What color is it?</p>
<sp who="Faust" desc='leise' xml:lang="de">
<l>Habe nun, ach! Philosophie,</l>
<l>Juristerei, und Medizin</l>
<l>und leider auch Theologie</l>
<l>durchaus studiert mit heißem Bemüh'n.</l>
</sp> |
Cliff Collins
(510)922-5204
|