OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

docbook-apps message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [docbook-apps] onsgmls: "1D6C2" is not a character number in the document character set


I'm on the Syntext Serna mailing list, and something just like this came up 
there as well. It seems this might be a Unicode issue for the parser.

According to the mail and response below, the Serna parser is giving the 
same error message on the same characters, the first of which is  #x1D6C2. 
These 5 digit unicode characters were introduced in DocBook 4.4 when the 
entity sets from W3C replaced the original DocBook entity sets.  The mail 
below calls it UCS-4, which is Unicode version 4, which I guess is when 
these higher code points were introduced.

The character #x1D6C2 isthe first in isogrk4.ent. In 4.4 and 4.5 it is 
declared as:

<!ENTITY b.alpha          "&#x1D6C2;" ><!--MATHEMATICAL BOLD SMALL 
ALPHA -->

In 4.3, it is declared as:

<!ENTITY b.alpha	"&#x03B1;"> <!--  -->

Falling back to the entities in 4.3 would be one workaround.  Investigating 
the details of the parser and how it handles Unicode might help too.

Bob Stayton
Sagehill Enterprises
DocBook Consulting
bobs@sagehill.net



From: "Paul Antonov" <apg@syntext.com>
To: "dmitry mukhin" <dmitry.mukhin@softjoys.com>
Cc: <serna-users@syntext.com>
Sent: Monday, August 14, 2006 6:47 AM
Subject: Re: [Serna] evaluation questions


>
> Dmitry,
>
> I think I can make an educated guess about your problem: few .ent files
> which come with DocBook 4.5 DTD inlcude UCS-4 character entities, which
> are not supported. You should use corresponding .ent files from the
> earlier DocBook distributions (or, better yet, you can take them from
> dist/plugins/docbook/dbk44/dtd/ent directory).
>
> Regards,
>
> -- Paul
>
> On Mon, 14 Aug 2006, dmitry mukhin wrote:
>
>> Hi,
>>
>> We are evaluating Serna at our company.
>> We have the following questions:
>>
>> 1. We have our own DTD that extends DocBook:
>>
>> "
>> <!ENTITY % DocBookDTD PUBLIC "-//OASIS//DTD DocBook V4.5b1//EN"
>> "http://www.oasis-open.org/docbook/xml/4.5b1/docbookx.dtd";>
>> %DocBookDTD;
>> "
>>
>> When we convert the DTD to XSD, your conversion tool "dtd2xs-2.0" fails
>> displaying the following message:
>>
>> "
>> ERROR: failed to parse DTD from c:\our.dtd
>> XML Parser: <URL>:42:321D6C2 is not a character number in the document
>> character setXML Parser: <URL>:43:321D6C3 is not a character number in
>> the document character setXML Parser: <URL>:44:321D6D8 is not a
>> character number in the document character setXML Parser:
>> <URL>:45:321D6AB is not a character number in the document character
>> setXML Parser: <URL>:46:321D6C5 is not a character number in the
>> document character setXML Parser: <URL>:47:321D6C6 is not a character

----- Original Message ----- 
From: "Sam Steingold" <sds@podval.org>
To: <docbook-apps@lists.oasis-open.org>
Sent: Sunday, August 13, 2006 12:45 PM
Subject: [docbook-apps] onsgmls: "1D6C2" is not a character number in the 
document character set


> with dtd v 4.4 I get these errors:
> onsgmls -s -e -g -c/usr/share/sgml/xml.soc clisp.xml
> In entity refentry included from clisp.xml:19:0
> In entity dbcent included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/docbookx.dtd:123:8
> In entity ISOgrk4 included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/dbcentx.mod:360:9
> onsgmls:/usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/ent/isogrk4.ent:42:30:E: 
> "1D6C2" is not a character number in the document character set
> In entity refentry included from clisp.xml:19:0
> In entity dbcent included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/docbookx.dtd:123:8
> In entity ISOgrk4 included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/dbcentx.mod:360:9
> onsgmls:/usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/ent/isogrk4.ent:43:30:E: 
> "1D6C3" is not a character number in the document character set
> In entity refentry included from clisp.xml:19:0
> In entity dbcent included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/docbookx.dtd:123:8
> In entity ISOgrk4 included from 
> /usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/dbcentx.mod:360:9
> onsgmls:/usr/share/sgml/docbook/xml-dtd-4.4-1.0-30/ent/isogrk4.ent:44:30:E: 
> "1D6D8" is not a character number in the document character set
>
> etc
>
> I did not get these errors with 4.3.
> what do they mean?
> how do I avoid them?
> thanks.
>
> -- 
> Sam Steingold (http://www.podval.org/~sds) on Fedora Core release 5 
> (Bordeaux)
> http://iris.org.il http://palestinefacts.org 
> http://israelunderattack.slide.com
> http://ffii.org http://thereligionofpeace.com 
> http://openvotingconsortium.org
> Diplomacy is the art of saying "nice doggy" until you can find a rock.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: docbook-apps-unsubscribe@lists.oasis-open.org
> For additional commands, e-mail: docbook-apps-help@lists.oasis-open.org
>
>
> 




[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]