OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

codelist message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [codelist] code list validation


On Thu, 07 Sep 2006 21:30:22 +0100, Ray Denenberg, Library of Congress  
<rden@loc.gov> wrote:

> You could say:
> <xsd:restriction base="xsd:string" codelist="http://....."/>
>
> There is no such syntax yet (that I'm aware of), most likely because  
> there is no standard codelist format yet. But it might be as simple as  
> above --
> defining a 'codelist' attribute on the restriction element within the  
> schema language. Of course it would need to be accepted into the  
> language by the
> W3C.

There are a couple of issues here that I want to mention.  Assuming that  
the TC takes genericode as its starting point, genericode supports (a)  
multiple keys for a code list, and (b) compound keys for a code list.  So  
there is no automatic way you can point to a code list definition and  
extract "the" set of codes.  Now, I expect 80% of code lists to only have  
a single simple (non-compound) key, so in many cases it could work, but it  
wouldn't work 100% of the time.  There would need to be a plan to cover  
the inevitable failures when it didn't work.

On a more philosophical level, one of the intentions of genericode was to  
define a format for code lists that wasn't tied to any particular  
implementation of a code list validation engine.  This differs from the  
philosophy of things like W3C XML Schema; I usually characterise it as a  
control language for XML validation engines, and not as a modelling  
language.  When you try and tie something too directly to the engines that  
implement it, you are forced to take implementation issues back into the  
format, and that makes it less generic.

I would like to see the TC produce a code list format that can be applied  
equally to Schemas, Java/C#/etc., databases, i.e. everything.  That  
requires a certain standoffishness in terms of not getting too closely  
tied to way any particular methods that these technologies might use for  
implementing code list validation.

My preferred approach would be for us to look at producing some reference  
tools for generating Schemas, Java enumeration classes, database tables,  
etc. from our code list representation format.  For example, for Schemas,  
a simple XSLT stylesheet could do the job for most cases, and it could  
generate a warning for cases that it couldn't handle.  The same could be  
done one way or another for the other targets too.

If groups like the W3C *want* to try and build direct support into their  
works, I wouldn't try to stop them, but I would be very cautious if  
implementers were pushing back for features that were only required to  
support implementation, and not required for the generic representation of  
code lists.  In a similar vein, I wouldn't be inclined to start opening  
discussions with the W3C until we at least a reference Schema generation  
solution in place, just so we don't start the external discussions before  
we have sufficient maturity in what we are doing.  Whenever you get into  
these external discussions, the external parties always ask you if you can  
do new things, and do them their way.  If you don't have sufficient  
maturity, you can end up agreeing to things that have a negative impact in  
the long run.  I suggest we make sure we are all happy ourselves before  
kicking off any outreach stuff.

How does that sound?

Cheers, Tony.

PS  If the W3C were to implement some kind of direct support for  
genericode-style codelists, I think it would be better done using a  
"key/keyref" style mechanism, rather than an "enumeration" style  
mechanism, but there would still be many questions to resolve.  I just  
can't see it happening until such time as the Code List Representation  
format has some marketshare and mindshare to speak of.
-- 
Anthony B. Coates
Senior Partner
Miley Watts LLP
Experts In Data
+44 (79) 0543 9026
Data standards participant: ISO 20022 (ISO 15022 XML), ISO 19312,  
UN/CEFACT TMG, MDDL, FpML, UBL.


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]