Subject: RE: [docbook-apps] Help needed testing CJK search support in webhelp
Hi Ann-Marie and other interested DocBook users,
I’ve posted some sample content in German, French, and Chinese from the Madriva documentation that Camille suggested:
http://www.thingbag.net/docbook/gsoc2010/sample-zh/content/ch01.html (Chinese content is beneath the headings)
The stemming and tokenization appears to me to be working. The one quirk I notice is in CJK content: if less than the full query string is matched, then search highlighting doesn’t work for those hits. This is a fairly minor annoyance.
Please take a looks and let me know if you see any problems. We still need to demo the case where we don’t have a stemmer but we’re working on a bug related to that situation.
From: Camille Bégnis [mailto:firstname.lastname@example.org]
I am bi-lingual, and would be able to test the German. I do not have large docbook files to provide, and would not have time to produce them before deadline. (Still completing my documentation and cleaning up final bugs)
I read French, but my Chinese is only conversational.
On Thu, Aug 12, 2010 at 1:53 PM, Kasun Gajasinghe <email@example.com> wrote:
Thank you very much for your kindness.
Mainly we are in need of verifying the search results for languages other than English. Currently webhelp has extensive support for English, French, German and CJK languages. As both David and I not familiar with these languages, it is little hard to verify the search output.
And we need to verify that build process we specified in the doc is precise, and easy to follow.
If you can try to build the webhelp and make sure it works perfectly with one of *your* docbook XML file, it is greatly appreciated.
And if you are familiar with one of these languages I stated above, and have docbook files to test them, it would be great.
Any feedback about this is welcome!
Ann, I hope you did a great work for this summer, and best of luck for your project! :)
David, if you can find some docbook files which doesn't have any confidential issues, please send them to the list.
On Thu, Aug 12, 2010 at 10:56 AM, Cramer, David W (David) <firstname.lastname@example.org> wrote: