OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

egov-comment message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Public Comment


Comment from: mail@makxdekkers.com

The following is the response of the DCMI Directorate in the public 
review of the document: 

Search Service Interoperability
Working Draft 01, 17 November 2003
Document identifier: wd-egov-searchservice-01
Location: http://www.oasis-open.org/egov/docs/
Editor: Eliot Christian, U.S. Geological Survey <echristi@usgs.gov>

-------------------------------------------------------------------- 

The basis for the document seems to be that a "search service" is 
the single way that Governments can create interoperability between 
government organisations and with external users of government 
information ("Governments must offer to intermediaries an 
information search interface").

However, the document does not make clear the difference between (to 
paraphrase): "search services/interfaces" as "database or Web 
applications designed to help people find information" as opposed to 
"search services/interfaces" as "specifications for structured 
messages passed by protocols". It therefore does not make clear that 
"search services" (in meaning #1) would need to run special software 
to support the "search services" of ISO 23950 (meaning #2).

The underlying model -- not explicitly stated -- seems to be that 
search systems are implemented (only) through real-time, on-line 
connections between Clients and Servers. Although this approach has 
been proven to work in one-to-one communications, the document does 
not acknowledge that there are known problems with searching across 
multiple sites, as would be required for multiple government 
organisations providing different (and possibly overlapping) parts of 
the information that is being sought.  (For example: multi-site 
search mechanisms are only as strong as their weakest link, so 
sophisticated error handling is needed, and searches can result in 
incomplete or confusing results, for example when update frequencies 
are not coordinated or when servers are temporarily unavailable.)

The document need not (and probably should not) discuss such
technical problems in detail.  However, it should at a minimum
distinguish search services in general from client-server-based
search services in particular and explain that such services
presuppose an installed base of special software on both the client 
and server sides for handling queries in real time.

In this context, the document should at least acknowledge "metadata 
harvesting" as an alternative and widely implemented approach to 
implementing interoperable search over distributed bodies of data.  
Ideally, the document would explain that an emerging practice of 
"institutional repositories" uses the Open Archives Initiative 
Protocol for Metadata Harvesting to compile centralised indexes for 
searching by harvesting metadata records from multiple servers that 
do not themselves need to support "search services" (in meaning #2) 
with special software.  Ideally, the document would also acknowledge 
that this approach addresses some of the problems with multi-site
searching mentioned above.

Taking a broader view of the requirements for searching and 
interoperability would provide a context for correcting the 
document's depiction of Dublin Core.  In the current draft, Dublin 
Core is described only as a set of tags for "embedded metadata", 
while the Open Archives Initiative is mentioned only as something 
which uses "this style of metadata". This seems doubly misleading 
because Dublin Core metadata may not just be embedded in Web 
documents, but is more often stored separately from the data itself, 
while the OAI Protocol is in fact not designed to do anything with 
embedded metadata at all. On the contrary, the OAI approach is 
based on the harvesting and merging of metadata records -- an 
approach which has, in turn, underscored the need for standards for 
describing resources.

Dublin Core, then, could be better characterized not as a "profile" 
of "meta element names" for use in "embedded metadata", but as a 
standardised set of semantics intended for merging into search 
services (in meaning #1) such as those provided for institutional 
repositories -- or at any rate in the context of "Metadata for 
Digital Objects" as opposed to just "Metadata within Digital 
Documents".

The section on Service Registries (Section 1.4.5), finally, could 
perhaps mention registry-building efforts within the Dublin Core 
community to help communities of practice converge on interoperable 
semantics.

Kind regards, Makx Dekkers.

--------------------------------------------------------------------
Makx Dekkers                                    mail@makxdekkers.com
Managing Director                    Dublin Core Metadata Initiative


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]