I am one of the other GSOC students. I would be happy to help my compatriot with testing.
Thank you very much for your kindness.
Mainly we are in need of verifying the search results for languages other than English. Currently webhelp has extensive support for English, French, German and CJK languages. As both David and I not familiar with these languages, it is little hard to verify the search output.
And we need to verify that build process we specified in the doc is precise, and easy to follow.
If you can try to build the webhelp and make sure it works perfectly with one of *your* docbook XML file, it is greatly appreciated.
And if you are familiar with one of these languages I stated above, and have docbook files to test them, it would be great.
Any feedback about this is welcome!
Ann, I hope you did a great work for this summer, and best of luck for your project! :)
David, if you can find some docbook files which doesn't have any confidential issues, please send them to the list.
On Thu, Aug 12, 2010 at 10:56 AM, Cramer, David W (David) <firstname.lastname@example.org>
Kasun knows more about the details of the stemmer, but I can point you to the documentation for the porter stemmer we used:
Currently, English, French, and German are supported.
You are correct search does not support wildcards in searches, and I don't believe that the algorithm would return results with the same base but different prefixes (i.e. searching for "inhibit" won't show pages with "exhibit"), but I think that's normal for any search engine.
I'll add "Support wildcards in query string" to the list of future features. I thought I had added it there already but I see now that it's not listed.
From: Robert Fekete [mailto:email@example.com
Sent: Thursday, August 12, 2010 7:14 AM
Subject: Re: [docbook-apps] Help needed testing CJK search support in webhelp
First of all, thank you for both of you for your work, it looks very promising!
I have a few questions about how search and stemming works:
- Is it possible to add partial matches to the search results? For example, now
if you search for install, installing, or installed, the same results are
returned (correctly), because these words all come from install. But if you
don't type the entire word (say, only 'inst'), there aren't any results.
- Am I right that the search engine does prefix-only matches? (nstall, *nstall,
etc. does not work)
To unsubscribe, e-mail: firstname.lastname@example.org
For additional commands, e-mail: email@example.com