OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.


Help: OASIS Mailing Lists Help | MarkMail Help

docbook-apps message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]

Subject: Re: [docbook-apps] Generating separate closing tags in XHTMLwebhelp output

On Sat, 5 Mar 2011 03:01:06 +0530
Kasun Gajasinghe <kasun.gajasinghe@gmail.com> wrote:

> There's some tools out there to parse dirty HTML tags and retrieve
> it's whole content. But lot of good tools don't have a compatible
> license with DocBook. Htmlcleaner looks like a good solution for
> adding the support for indexing/searching *html* files though. So,
> full support for html would come!

tagsoup from John Cowan is my tool of choice for this.
I even have a version which I use as the parser for input to Saxon
which lets me process html as XML, using full xpath.




Dave Pawson

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]