OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

unitsml message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: FW: [chairs] Automated checking of OASIS schemas and NDR alignment


Dear UnitsML TC,
 
Since I've put this topic on the agenda, I thought it appropriate to send this message from David Webber (that he sent to the OASIS chairs) to the UnitsML TC.
 
Sincerely,
Bob Dragoset
 

Robert A. Dragoset, physicist
Physics Laboratory
National Institute of Standards and Technology

NIST
100 Bureau Drive, Stop 8400
Gaithersburg, MD 20899
Phone: 301-975-3718
Email: dragoset@nist.gov
Website: http://physics.nist.gov


From: David RR Webber (XML) [mailto:david@drrw.info]
Sent: Saturday, August 08, 2009 12:20 PM
To: Matthew Dovey
Cc: Chris Kaler; Kelvin Lawrence; Mary McRae; Robin Cover; chairs@lists.oasis-open.org
Subject: [chairs] Automated checking of OASIS schemas and NDR alignment

Since we are on this topic - the real issue is that XSD schema is a double edged sword - fabulously complex and hence challenging to desk check manually.

We could consider alternative strategies to improve quality - I would suggest a voluntry approach here.

For those that care to use it the OASIS CAM specification provides support for automated verification of XSD schema.

The open source toolkit is available on SourceForge.net in the camprocessor project.

Essentially what is does is ingest the XSD Schema - formulate it using the CAM template as an ABSTRACTION LAYER - that then allows automated inspection by particularly xslt scripting.

The following OASIS specifications have already benefited from this - EDXL, CIQ, and EML and then non-OASIS spec's including PESC, MISMO and then the NIEM.gov work.  The toolkit has been able to detect errors that in some cases have laid dormant for 3 years in published standards.

Here's a brief check list of what the toolkit will check for you -

1) Non-UTF8 characters in the schema

2) Naming and design rules consistency - currently this is drawn from NIEM and CEFACT NDR best practices - but is fully customizable in XSLT scripting

3) Common issues WRT Schema and interoperability (a major OASIS goal?) that are flagged as warnings

4) Build XML dictionary of your schema - that can be loaded into Excel spreadsheet - invaluable in manually inspecting actually what is being used where and the information model itself - and what is new release to release.

A sample of the output of the evaluator is below for one schema from OASIS EML - checking 1,500 item and 3,000 rules - clearly a challenge manually.  

You can see the gap here this illustrates between existing standards (in this OASIS EML case dating back to 2001) and todays expected quality evaluations.  Closing this gap while supporting an existing user community is obviously a challenge - but in EML case - we are at least starting that journey.

I believe it is inevitable that OASIS will move in this direction - automated checking - because the users out there demand ever better quality in OASIS specifications - and also expect industry best practices to be followed when engineering schema - which points to OASIS NDR - and aligning that with existing practice such as NIEM.gov.  And of course eventually we can expect at least government level users to insist on this before approving use of a standard - because internally they are already using these evaluation tools on their own schemas.

Thanks, DW

CAM Template Evaluation

Version 1.16

CAM Template HEADER information:

Description: EML 150 Geodistricting schema template
Owner: OASIS, Copyright 2009.
Date: 2009-08-04T08:02:22
Version: 6.0

RULES INTEGRITY:

NAMING AND DESIGN RULES (NDR) ASSESSMENT:

ISSUES AND WARNINGS:

EXTERNALS:

   Namespace URL


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]