OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

docbook-apps message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [docbook-apps] Re: xml/groff: comments in text=>whitespacestripped?!


> * In message <20030606142741.GA24261@redhat.com>
> * On the subject of "Re: [docbook-apps] Re: xml/groff: comments in text=>whitespace stripped?!"
> * Sent on Fri, 6 Jun 2003 15:27:41 +0100
> * Honorable Tim Waugh <twaugh@redhat.com> writes:
>
> On Fri, Jun 06, 2003 at 09:56:17AM -0400, Sam Steingold wrote:
> 
> > I would argue that this is incorrect.
> > Since newline is syntactically significant in groff,
> > arbitrarily removing it is a bug.
> 
> But then the following would have all spaces preserved:
> 
>      <para>Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
>        Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
>        ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
>        dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
>        sit amet.</para>
> 
> And that would create:
> 
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
>        Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
>        ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
>        dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
>        sit amet.
> 
> rather than:
> 
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.
> Lorem ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem
> ipsum dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum
> dolor sit amet.  Lorem ipsum dolor sit amet.  Lorem ipsum dolor
> sit amet.
> 
> which is actually what you want.

good point.

1. you can either remove just the leading spaces (but not NLs), or

2. do the space normalization __before__ the lower-level processing,
   i.e., _first_ normalize-space on the raw XML text, and only _then_
   call apply-templates.

PS. "Lorem ipsum dolor sit amet" sounds Latin (and is Greek to me).
    What does it mean?

-- 
Sam Steingold (http://www.podval.org/~sds) running RedHat9 GNU/Linux
<http://www.camera.org> <http://www.iris.org.il> <http://www.memri.org/>
<http://www.mideasttruth.com/> <http://www.palestine-central.com/links.html>
Whom computers would destroy, they must first drive mad.


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]