OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

search-ws message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [search-ws] FW: SRU 1.3 / 2.0 and faceted search


Title: Re: [search-ws] FW: SRU 1.3 / 2.0 and faceted search
Hi Ray:

Agreed – that certainly seems like a plan.

Note that we (Nature Publishing Group) have, as of last week, a working (i.e. live) implementation of faceted search based on SRU 2.0:

<searchRetrieveResponse xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://docs.oasis-open.org/ns/search-ws/sru-2-0-response http://www.loc.gov/standards/sru/oasis/schemas/sru-2-0-response.xsd" xmlns="http://docs.oasis-open.org/ns/search-ws/sru-2-0-response">
....
</searchRetrieveResponse>

This is using the latest SRU 2.0 schema:

<?xml version="1.0" encoding="UTF-8"?>
<!--  
Last updated June 30, 2010.  Ray Denenberg, Library of Congress.  rden@loc.gov
-->
...

Any search on our server will return faceted data, see e.g.

    http://www.nature.com/opensearch/request?query=foo

And below is an example of the actual faceted data we return. (Apologies for the auto-generated namespacing – but Apache Axis seems to be a thing unto itself – we’re not fully in control.)

We had thought about using the <extraRequestData> route with an SRU 1.2 schema but for various other reasons (especially OpenSearch compatibility) wanted to be SRU 2.0 compliant and so upgraded the schema. (I’m not sure what the mention here is of SRU 1.3. I can’t recall there being a version 1.3.)

Cheers,

Tony

==
   <facetedResults xmlns:ns5="http://docs.oasis-open.org/ns/search-ws/sru-2-0-response">
    <ns6:facet xmlns="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults" xmlns:ns6="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults">
     <ns6:facetDisplayLabel>meta:pcode</ns6:facetDisplayLabel>
     <ns6:terms>
      <ns6:term>
       <ns6:actualTerm>nature</ns6:actualTerm>
       <ns6:count>112</ns6:count>
      </ns6:term>
      <ns6:term>
       <ns6:actualTerm>bjc</ns6:actualTerm>
       <ns6:count>49</ns6:count>
      </ns6:term>
      <ns6:term>
       <ns6:actualTerm>hdy</ns6:actualTerm>
       <ns6:count>34</ns6:count>
      </ns6:term>
      <ns6:term>
       <ns6:actualTerm>jid</ns6:actualTerm>
       <ns6:count>32</ns6:count>
      </ns6:term>
      <ns6:term>
       <ns6:actualTerm>ajh</ns6:actualTerm>
       <ns6:count>23</ns6:count>
      </ns6:term>
     </ns6:terms>
    </ns6:facet>
    <ns7:facet xmlns="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults" xmlns:ns7="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults">
     <ns7:facetDisplayLabel>meta:jtitle</ns7:facetDisplayLabel>
     <ns7:terms>
      <ns7:term>
       <ns7:actualTerm>Nature</ns7:actualTerm>
       <ns7:count>112</ns7:count>
      </ns7:term>
      <ns7:term>
       <ns7:actualTerm>British Journal of Cancer</ns7:actualTerm>
       <ns7:count>49</ns7:count>
      </ns7:term>
      <ns7:term>
       <ns7:actualTerm>Heredity</ns7:actualTerm>
       <ns7:count>34</ns7:count>
      </ns7:term>
      <ns7:term>
       <ns7:actualTerm>Journal of Investigative Dermatology</ns7:actualTerm>
       <ns7:count>31</ns7:count>
      </ns7:term>
      <ns7:term>
       <ns7:actualTerm>American Journal of Hypertension</ns7:actualTerm>
       <ns7:count>23</ns7:count>
      </ns7:term>
     </ns7:terms>
    </ns7:facet>
    <ns8:facet xmlns="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults" xmlns:ns8="http://docs.oasis-open.org/ns/search-ws/sru-facetedResults">
     <ns8:facetDisplayLabel>meta:subjkeyword</ns8:facetDisplayLabel>
     <ns8:terms>
      <ns8:term>
       <ns8:actualTerm>Cancer</ns8:actualTerm>
       <ns8:count>113</ns8:count>
      </ns8:term>
      <ns8:term>
       <ns8:actualTerm>Clincal medicine</ns8:actualTerm>
       <ns8:count>88</ns8:count>
      </ns8:term>
      <ns8:term>
       <ns8:actualTerm>Development</ns8:actualTerm>
       <ns8:count>85</ns8:count>
      </ns8:term>
      <ns8:term>
       <ns8:actualTerm>Genetics</ns8:actualTerm>
       <ns8:count>85</ns8:count>
      </ns8:term>
      <ns8:term>
       <ns8:actualTerm>Medical Research</ns8:actualTerm>
       <ns8:count>55</ns8:count>
      </ns8:term>
     </ns8:terms>
    </ns8:facet>
   </facetedResults>
==

On 6/8/10 16:45, "Ray Denenberg, Library of Congress" <rden@loc.gov> wrote:

See note below.  I would like to respond that I think that faceted search is stable enough and that his strategy seems sound.  --Ray
 

From: Edo Plantinga [mailto:Edo.Plantinga@ictu.nl]
Sent: Friday, August 06, 2010 10:30 AM
To: Denenberg, Ray
Cc: Andre van Brussel; [name removed by OASIS admin]; Michael Rijsdijk; Marco Aarts
Subject: SRU 1.3 / 2.0 and faceted search


Dear Mr. Denenberg ,



Please allow me to introduce myself. My name is Edo Plantinga and I work for the Dutch government. We are currently migrating the search engine behind the national Dutch government website overheid.nl ("government.nl") to a new search engine. We are taking this opportunity to make the indexed data available via an SRU webservice. For our website, we need faceted search as well. I understand that faceted search is not yet a part of the SRU 1.2 definition, but that it is going to be a part of the SRU 2.0 definition.

I saw that you are the chair of the committee that decides about the SRU 2.0 standard. I hope that you can find the time to answer these short questions. If not, maybe you could be so kind to refer me to somebody who can.
Given the time pressure this part of our project is currently under: if you may respond to me promptly, I would gratefully appreciate it.
Yours sincerely,

Edo Plantinga    -    Technisch adviseur CVDR

__________
Voor technische en andere vragen over de CVDR graag eerst contact met de servicedesk: (070) 889 61 37 / regelgeving@overheid.nl <mailto:regelgeving@overheid.nl&nbsp;> / Website CVDR <blocked::http://www.e-overheidvoorburgers.nl/producten,decentrale-regelgeving>
__________
ICTU <http://www.ictu.nl/> - Programma e-Overheid voor Burgers <blocked::http://www.e-overheidvoorburgers.nl/> (fusie van voormalige programma's MijnOverheid en Overheid heeft Antwoord© ).
Email: edo.plantinga@ictu.nl <mailto:edo.plantinga@ictu.nl> / Telefoon: 070 - 889 6276 / Mobiel: 06-2498 1976

Met vriendelijke groet,
 
Edo Plantinga  -  Technisch adviseur CVDR
__________
Communicatie omtrent CVDR:
Telefoon Servicedesk (070) 889 61 37
E-mail algemeen: regelgeving@overheid.nl
__________
ICTU - Programma E-overheid voor Burgers Telefoon: 070 - 889 6276 / Mobiel: 06-2498 1976 / Email: edo.plantinga@ictu.nl


********************************************************************************   
DISCLAIMER: This e-mail is confidential and should not be used by anyone who is
not the original intended recipient. If you have received this e-mail in error
please inform the sender and delete it from your mailbox or any other storage
mechanism. Neither Macmillan Publishers Limited nor any of its agents accept
liability for any statements made which are clearly the sender's own and not
expressly made on behalf of Macmillan Publishers Limited or one of its agents.
Please note that neither Macmillan Publishers Limited nor any of its agents
accept any responsibility for viruses that may be contained in this e-mail or
its attachments and it is your responsibility to scan the e-mail and 
attachments (if any). No contracts may be concluded on behalf of Macmillan 
Publishers Limited or its agents by means of e-mail communication. Macmillan 
Publishers Limited Registered in England and Wales with registered number 785998 
Registered Office Brunel Road, Houndmills, Basingstoke RG21 6XS   
********************************************************************************


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]