legalruleml message

Subject: Revised Section 5

From: Tara Athan <taraathan@gmail.com>
To: legalruleml@lists.oasis-open.org
Date: Mon, 2 Nov 2015 12:45:20 -0500

Hi - I have revised section 5 of the working draft with an expandeddescription of the syntax design which is (or should be!) consistentwith the current schemas. With the latest changes that simplified thesyntax and schemas, it became quite a bit easier to give a coherentdecription of the design, although the design is still fairly complex.The new content is the text below. @@@'s denote places where somethingis missing. Most of them are bibliographic or cross-references, but someare issues that merit discussion.


Other new things in the commits

* The build script has been significantly enhanced in order to test theschemas more than before, especially in regard to preserving theabstract syntax (ie. the RDF) when performing transformations, such asconverting between compact and normalized serializations, on instances(our fixed examples and randomly generated instances from the schemas).It now takes over an hour for the build to run, what with all the testing.* In addition to parsing to the RDF form that terminates at thetemplates, I have applied the same conversion principles to the templatecontents, to ensure that the information within the templates is alsopreserved when the transformations are carried out.* metamodel RDFS documents were updated to include the newLegalReference(s) and Source(s) entities. I have not yet updated thediagrams.


Tara

5 LegalRuleML XML Design Principles (non-normative)

The concrete XML-based syntax for LegalRuleML was designed based on theprinciples in Sec. 2.3, as well as certain design principles that arespecific to XML-based syntaxes (section 5.1) and additional designprinciples (sections 5.2-5.9) that are domain-specific. In particular,many of the XML conventions developed in RuleML are adopted inLegalRuleML, providing common principles for the merged languagehierarchy. All statements herein about the RuleML syntax are inreference to the elements in the RuleML namespace that are allowed to beembedded within LegalRuleML documents; as such, these are restrictionsfrom the more general RuleML syntax as well as extensions in regard tochild elements in the LegalRuleML namespace.


5.1 XML Elements vs. Attributes

A common design decision for XML-based languages is whether to use anXML element or an attribute to represent a particular abstract syntacticfeature. General guidelines are:– If the information in question could be itself marked up withelements, put it in an

element, because attributes cannot contain such complex content;

– If the information is suitable for attribute form (i.e., not complex),but could end upas multiple attributes of the same name on the same element, use childelements

instead, avoiding list datatypes for attributes;

– If the information is required to be in a standard XML schemaattribute type such as

ID, IDREF, ENTITY, KEYREF, use an attribute;

– If the information should not be normalized for white space, useelements (XMLprocessors normalize attributes in ways that can change the raw text ofthe attribute

value.).

5.2 LegalRuleML Syntactic Requirements

The following syntactic characteristics were deemed mandatory for theLegalRuleML syntax:1. An abstract syntax for LegalRuleML must be described by an RDFSmetamodel.2. Two equivalent XML-based concrete serializations of the abstractsyntax must be specified: the normalized serialization and the compactserialization. Each constraint of the specification must be in one ofthe following formats: Relax NG grammar (RNC format), XSD 1.0 schema, ornatural language statement.3. Parsing from either LegalRuleML concrete serialization to theLegalRuleML abstract syntax in RDF/XML format must be specified by acomposition of XSLT transformations.4. A pair of abstract-syntax preserving XSLT transformations, called thecompactifier and the normalizer, must convert LegalRuleML documentsbetween compact and normalized serializations.5. The conformance level of a document must be preserved by thecompactification and normalization transformations. I.e., anXSD-conformant document must still be XSD-conformant aftertransformation, and similarly for RNC-conformance.


5.3 Syntactic Objectives

The following syntactic characteristics were deemed desirable for theLegalRuleML syntax, but could not all be simultaneously satisfied. TheLegalRuleML syntax was designed to optimize over these characteristicsto the extent possible:1. maximize correspondence to the RDF-based abstract syntaxrepresentation in the normalized serialization.

2. minimize verbosity, especially in the compact serialization.

3. minimize redundancy of expression, avoiding multiple ways to expressthe same thing.4. minimize the difference between the syntax defined by the RNC and XSDschemas.5. minimize the additional constraints not expressible in either RNC orXSD schemas.6. minimize the additional constraints (from #5) not expressible throughabstract-syntax preserving validating XSLT transformation.7. (related to 5 and 6) minimize discrepancies after round-triptransformation between the compact and normalized serializations ofinstances that validate against RNC and XSD schemas.

8. minimize the modifications to imported RuleML schemas

9. minimize the set of schema-conformant instances that do not satisfy around trip law between serializations after projection by theabstract-syntax preserving validating transformations.10. minimize the modifications that are necessary in the projections (asdescribed in #9) to instances that satisfy the round-trip laws.


5.4 Node and Edge Element Dichotomy

In order to satisfy objective 5.3.1, LegalRuleML adopted, for itsnormalized serialization, a form of striped syntax, where Node elementsalternate with edge elements, forming a bipartite pattern, similar tothe striped syntax of RDF/XML. The striped syntax of LegalRuleMLnormalized serialization is intended to represent explicitly theentity-relationship structure of the abstract syntax.

The LegalRuleML schemas specify two groups of elements: Node (alsocalled type) elements and edge (also called role) elements, the elementname of the former starting with an upper case letter, and the latterwith a lower case letter. The one exception to this pattern is the<ruleml:slot> element, which is neither a Node or edge element.

Node elements correspond to classes of the metamodel while edge elementsrepresent relationships between members of these classes. Edge elementscorrespond, in most cases, to properties in the metamodel. In a fewcases, edge elements correspond to compositions of such properties. Theruleml:slot element also corresponds to the structure of the abstractsyntax, with the first child of the slot corresponding to the propertyof a triple and the second child corresponding to the object of the triple.

In some cases, the metamodel is sufficiently restrictive so that theedge element provides no additional information, allowing for a losslessconversion from the normalized serialization to an XML representationthat is less verbose by simply deleting the start and end edge tags. TheLegalRuleML compact serialization is defined in this way.

In the XML document tree, elements that have no children are calledbranch elements, otherwise they are called leaf elements. Element typesmay be classified according to whether their instances are all leafelements (Leaf type), all branch elements (Branch type) or either(Leaf/Branch type).


5.4.1 Node Elements
The naming convention for Node elements is UpperCamelCase local names.

The qualified name of a Node element corresponds to the type of thesyntactic construct defined by the Node element, i.e., an rdf:typerelationship in the RDF-based abstract-syntax representation (@@@ referto metamodel section). The IRI of the metamodel class is constructed byconcatenating local name of the Node element with the appropriate IRIprefix:* http://docs.oasis-open.org/legalruleml/ns/v1.0/metamodel# for Nodeelements in the LegalRuleML namespace* http://docs.oasis-open.org/legalruleml/ns/v1.0/rule-metamodel# forNode elements in the RuleML namespaceWe use the prefixes lrmlmm and rulemm, reps., to abbreviate themetamodel IRIs. At the time this document was published, the RuleMLspecification did not provide a metamodel, but a RuleML metamodel isunder development @@@ref.


Classification of Node Elements

Collection Node element: In general, a Collection Node element is a Nodeelement that defines a syntactic construct which is a collection. InLegalRuleML’s RDF-based metamodel, these constructs are of typerdf:List. The naming convention of Collection Nodes in LegalRuleML is touse the plural of the type of the members of the collection. Forexample, a collection for constructs of type lrmlmm:Authority isspecified with an lrml:Authorities element. RuleML has no Collection Nodes.

Document Node element: In general, a Document Node element is a Nodeelement that can serve as the root node of an instance document. InLegalRuleML, the element lrml:LegalRuleML is the only Document Node element.

Annotation Node element: In general, an Annotation Node element containsmixed content, and is intended to hold marked-up text. In LegalRuleML,the Annotation Nodes are the Nodes lrml:Comment and lrml:Paraphrase.RuleML has no Annotation Nodes.

In general, Node elements may have Leaf, Branch, or Leaf/Branch types.In the LegalRuleML namespace, all Nodes types are Leaf/Branch type,while in the RuleML namespace, Nodes types are mostly Leaf or Branchtypes, with a few exceptional Leaf/Branch types.

Every LegalRuleML Node element may optionally have a child element thatattaches a comment to it, specified in an lrml:Comment element (seesection @@@Comment/Paraphrase).

The types of RuleML Branch or Leaf/Branch Node element have beenextended in the LegalRuleML syntax so that RuleML elements within aLegalRuleML document may optionally have a child element that attaches aparaphrase to it, specified in an lrml:Paraphrase element (see section@@@Comment/Paraphrase).

A group of common optional attributes for most LegalRuleML Node elements(called in the schemas commonLRMLNodeInit.attlist) are the following:

* @key
* @keyref
* @type

with the exception of <lrml:Reference> and <lrml:LegalReference>, whichare not allowed to have these attributes. See section @@@Key/Keyref fordetails of the usage of @key and @keyref attributes, and @@@MetamodelRefinement for details of the usage of #type.

Common optional attributes for most RuleML Node elements withinLegalRuleML documents are

* @key
* @keyref
* @xml:id

The @key and @keyref attributes have a different content model than thecorresponding attribute in LegalRuleML elements (see section@@@Key/Keyref). The usage of the @xml:id attribute is described in@@@Identifiers

The root element of every LegalRuleML document is a Node element (inparticular, <lrml:LegalRuleML>). This root element may optionally havethe following attributes:

* @xml:base
* @hasCreationDate
* @xsi:schemaLocation

in addition to the common optional Node attributes. The semantics of@xml:base and @xsi:schemaLocation are defined by the @@@XML and @@@XSDspecifications, respectively. The @hasCreationDate attribute hassemantics related to Dublin Core’shttp://dublincore.org/documents/dcmi-terms/#terms-created, except thatthe Dublin Core property takes a literal value, while @hasCreationDatetakes a local identifier reference to a ruleml:Time entity.

Specialized attributes may be optional or required for a subset of Nodeelements, as follows:

* @pre, on Prefix
* @refID, on Prefix, Reference or LegalReference
* @sameAs on Source, LegalSource, Agent, Authority, Jurisdiction

* @iri on Annotation Nodes, Role, LegalRuleML Deontic Nodes and DeonticKey Nodes (@@@see Identifiers)

* @refersTo (on Reference and LegalReference)

* @refType, @refIDSystemName, @refIDSystemSource (on Reference,References, LegalReference, LegalReferences)

* @memberType (on Collections)
* @hasCreationDate (on LegalRuleML and Context)
* @strength
* @over, @under

Additionally, @xml:base is allowed on ruleml:Data elements with anexplicit datatype of xsd:anyURI.


5.4.2 Edge Elements
The naming convention for Node elements is lowerCamelCase local names.

Classifications of Edge Elements

Collection Membership Edge: In the LegalRuleML namespace, collectionmembership edges are the children of Collection Nodes (definingsyntactic constructs of type lrmlmm:Collection) that define themembership of the collection. The local names of these edges begin with‘has’, followed by the name of the collection member type. For example,the collection edge for a lrml:Authorities collection islrml:hasAuthority - the parent of an lrml:hasAuthority element is alwayslrml:Authorities, and its child is always lrml:Authority. Englishgrammar conventions are followed when relating the plural form used inthe name of the collection with the singular form used in the collectionedge. Note that not all edges whose local name begins with ‘has’ arecollection edges. In the RuleML namespace, an edge is a collection edgeif and only if it has an @index attribute. The local names of RuleMLcollection edges are ‘arg’, ‘content’ and ‘formula’. The first two arealways collection edges, while ruleml:formula is only a collection edgewhen its parent is ruleml:And, ruleml:Or, ruleml:Operation, orlrml:SuborderList.

Document Edge: In LegalRuleML, document edges are the edges whose parentNode element is the Document Node element, the root of the XML document.The local names of these edges begin with ‘has’, followed by the name ofthe (unique) child element.

Annotation Edge: In LegalRuleML, annotation edges contain an AnnotationNode element. The local names of these edges begin with ‘has’, followedby the name of the (unique) child element.

The types of edge elements may be classified by syntactic type as Leaf,Branch or Leaf/Branch types. RuleML edge elements have only Branchtypes, while LegalRuleML edge elements have mostly Leaf (see section@@@Leaf Edge Elements) or Branch type (see section @@@Branch EdgeElements), with a few exceptional Leaf/Branch types (see section@@@Leaf/Branch Edge Elements).

The qualified name of an edge element, in most cases, corresponds to aproperty of the syntactic construct defined by its parent Node element,i.e., the property of a triple in the RDF-based abstract-syntaxrepresentation (@@@ refer to metamodel section). The IRI of themetamodel property is constructed by concatenating the local name of theedge element with the appropriate IRI prefix:* http://docs.oasis-open.org/legalruleml/ns/v1.0/metamodel# for Nodeelements in the LegalRuleML namespace* http://docs.oasis-open.org/legalruleml/ns/v1.0/rule-metamodel# forNode elements in the RuleML namespacewith the exception of collection edges. The order in the collection isspecified by the order of the sibling collection edges in theLegalRuleML document.

Edge elements may be classified as skippable or non-skippable, relativeto the syntax. In the LegalRuleML namespace, it is exactly theBranch-type edge elements that are skippable, while Leaf-type andLeaf/Branch-type elements are non-skippable. Branch-type edges are thefollowing:

* collection edges
* document edges
* annotation edges
* the edges lrml:hasTemplate, except within lrml:FactualStatement

The RuleML edge elements that are considered skippable withinLegalRuleML documents are the following:

* lrml:arg
* lrml:op
* lrml:formula
* lrml:declare
* lrml:strong
* lrml:weak
* lrml:left
* lrml:right
* lrml:torso

A group of common optional attributes for non-skippable LegalRuleML edgeelements (called in the schemas commonLRMLEdgeInit.attlist) is defined,and contains only the following:

* @xml:id

The value of this attribute provides an identifier for the correspondingtriple in the RDF-based abstract syntax representation.

Leaf-type edge elements have a required attribute that points to theobject of the relationship they define. If the object is required tohave a local identifier, then @keyref is the required attribute,otherwise it is @iri. Note that the Source, LegalSource, Reference, andLegalReference constructs are provided so that external resources can bealiased with a local identifier that may then be used as the value of an@keyref attribute.



5.5 Generic Node elements

A generic element is a main element whose syntax and/or semantics isunderspecified unless an attached attribute attribute or header elementprovides a predefined value or an IRI pointer to a user-definedsignature. For example, <ruleml:Operation> is a generic connectiveoperator, which may be used for modal operators or logical connectivessuch as exclusive disjunction. Generic elements provide extension pointsfor user-defined syntactic and semantic variation. The following tableprovides a listing of Generic Node elements and the attributes or headerelements that may be used to specialize them.


ruleml:Operation   @type
ruleml:Negation    @type
ruleml:Rule        @strength, hasStrength

5.6 Serializations

Two equivalent normative serializations are defined in the Relax NG andXSD schemas – a normalized serialization and a compact serialization.

5.6.1 Normalized Serialization

In many cases, edge elements are redundant because they could bereconstructed based on the type or position of the parent and child nodeelements. RuleML syntax allows such edges to be optionally skipped,resulting in its stripe-skipped serialization. LegalRuleML syntax allowsthe two extreme cases - either no edges are skipped in the document (thenormalized serialization) or all skippable edges in the document areomitted (the compact serialization). The normalized serialization may bereconstructed from a document in compact serialization by applying thenormalizer XSLT transformation, which reconstructs the skipped edges.


5.6.2 Compact Serialization

The compact serialization of LegalRuleML reduces verbosity without lossof information.The compact serialization may be derived from the normalizedserialization by removing the start and end tags of skippable edgeelements.The compact serialization may be obtained from a document in normalizedserialization by applying the compactifier XSLT transformation.

Note that RuleML has a relaxed serialization that allows edges to beoptionally skipped, and also allows a (mostly) arbitrary ordering ofchild elements. RuleML in the relaxed serialization is not allowed to beembedded within LegalRuleML – the embedded RuleML must be in eithernormalized or compact serialization, consistent with the serializationof the parent LegalRuleML.


5.7 General Design Patterns
Inside of LegalRuleML we employ five well-known design patterns:

container, which is a structure of elements having independent existence(e.g.,<Context> can include several <Association> sub-elements);collection, a subpattern of container that is in the form of a list ofelements of the same type (e.g., <Roles> that is a sequence of <Role>elements);recursive element (e.g., <Obligation> can include other <Obligation>elements);marker, an element that uses attribute @sameAs for identifying a source,e.g., <lrml:LegalSource key="sec504-clsa-pnt1"sameAs="&UScode;#title17-chp5-sec504-clsa-lst1-pnt1"/>composite elements that are made up of different dependent parts, (e.g.,a rule <Rule> consists of an antecedent <if> and conclusion <then>).


5.7.1 Collection Design Pattern

LegalRuleML uses a collection design pattern for organizing andefficiently representing and referring to metadata.The lrmlmm:Collection class in the LegalRuleML metamodel is thesuperclass for these syntactic constructs, which is in turn a subclassof ref:List.

The lrmlmm:hasMember property

The name of the collection element indicates the type of its members.

Properties can be assigned to all members using an attribute on orheader child element within the collection element.Metadata collections must occur in a prescribed order in a LegalRuleMLdocument.


5.7.2 Recursive Element Pattern

The RuleML syntax uses recursive elements, i.e. elements that may havedescendants of the same name, to represent the inherently recursivenature of logical connectives and functional expressions. LegalRuleMLintroduces some specialized logical connectives which are similarlyrecursive, as follows:

* lrml:Obligation
* lrml:Permission
* lrml:Prohibition
* lrml:Right
* lrml:SuborderList

In order to reduce redundancy through modularization, some of thecollections elements are recursive, as follows:

* lrml:Sources
* lrml:LegalSources

Similarly, the element that is used to efficiently construct contextualrelationships is recursive to facilitate modularization:

* lrml:Association

5.7.3 Marker Interface Pattern

The marker interface pattern is used in programming to annotateentities. In LegalRuleML, external entities are in many cases requiredto be aliased with a local identifier, which may then be referenced asthe subject or object of annotations. The syntax that implements themarker interface pattern in LegalRuleML consists of the followingattributes:

* @key
* @keyref
* @sameAs
* @refersTo
* @refID

5.7.4 Composite Element Pattern

@@@This is a foundational design decision for the metamodel. There arepros and cons to the composite element pattern.The parser to the abstract syntax is currently written in a way thatdoes *not* invoke the composite element pattern.In particular, if there is a Node element with a @keyref attribute andno content, a triple is created that has the IRI from that keyref asobject - a clone of the object is *not* created.

As a consequence, a formula could be the head of two different rules.

If we have the composite element pattern, then each rule owns its head(and body), so the two different rulesmust have two different heads, even if they are occurrences of the sameproposition.

@@@

5.8 Specialized Design Patterns
5.8.1 Ordered-Children Design Pattern

In the normalized serialization, when the order of children issignificant to the semantics of the parent Node element, an indexattribute is required on the edges so that the order is made explicit.In the compact serialization, the edge elements that would have an indexattribute are skipped, so that the order of occurrence of children inthe XML document is significant.

Example: SuborderList
5.8.2  Leaf edges

LegalRuleML introduces a syntactic pattern that is not present in RuleML– the leaf edge element:

 is an edge element that is empty
always has at least one attribute, typically @keyref
See stripe_leaf_module.rnc (Stripe Required, Leaf Not Obligatory

and Stripe Optional, Leaf Not Obligatory) andstripe_required_module.rnc (Leaf Obligatory)

5.8.3 Slot Design Pattern

LegalRuleML adopts the slot design pattern as implemented in RuleML forexpressing properties of deontic formulas. This design pattern comesfrom frame language and serves to store information about a frame as a"property-value" pair.


5.9 CURIES, Relative IRIs and the xsd:ID Datatype

• LegalRuleML employs a variety of syntactic forms for labelingcomponents with identifiers, and for referring to these or otheridentifiers. In this section, we discuss the syntactic forms that arebased on the IRI system, and compare to the corresponding forms employedin RuleML.

Need overview of prefixing and abbreviation of IRIs
qualified names (<ruleml:Rule>, “xs:integer”)
xsd:ID datatype (key = “rule1”)
same-document reference (keyref = “#rule1”)
relative IRIs other than same-document reference (“../otherdoc.lrml#rule2”)
CURIEs (iri=”ex:servb/otherdoc.lrml”, keyref=”:#rule1”)
IRI (iri=”http://servb/otherdoc.lrml”;)
CURIE datatype follows RDFa (@@@citation)

conflicting ID-types for attribute “key” of element“TemporalCharacteristic” from namespace“http://docs.oasis-open.org/legalruleml/ns/v1.0/”;


5.10 Distributed Syntax

The @key and @keyref attributes are used to enable the distributeddefinition of syntactic constructs in both RuleML and LegalRuleMLnamespaces. The @key attribute supplies a local identifier for thesyntactic construct, and . In particular, the value of the @keyattribute, of datatype xs:ID, is concatenated with the base IRI of theLegalRuleML document (as specified by the xml:base attribute on theroot, if present, and otherwise determined according to @@@) toconstruct the IRI of the syntactic construct defined by the parent Nodeelement of the @key attribute.

The @keyref is used to reference the local identifiers defined by @keyattributes. It is a syntactic error for a @keyref attribute to have avalue that does not correspond to a local identifier in the same document.

When @keyref occurs on an element with no attributes or content, then itis reference to the syntactic construct having that local identifier.When @keyref occurs on a parent element that has attributes and/orcontent, it refers to a new syntactic construct that is based on thereferenced construct, modified by the attributes and/or content of theparent. Such elements are translated into the abstract syntax using thelrmlmm:mergerOf property.

Taking into account the simultaneous occurrence of @key and @keyrefattributes on the same element, these references form a directed graphwhich must be acyclic; it is a syntactic error if this graph is not acyclic.


5.11 Metamodel Refinement

The @type attribute is used to refine the semantics of LegalRuleMLelements by reference to external resources. An @type attribute on anyelement is translated into the abstract syntax as an RDF triple withproperty rdf:type. From this, it may be inferred that the resource is anrdf:Class, and the syntactic construct defined by the element on whichit occurred is an instance of that class, as well as an instance of theclass of the LegalRuleML metamodel corresponding to the element’s name.Note that the @type attribute has quite different semantics on RuleMLelements (see @@@RuleMLSpec).


5.12 Annotations - Comment and Paraphrasre

5.13 Identifiers - @xml:id and @iri

An @iri attribute on a Node element correspond to an owl:sameAsrelationship in the abstract syntax.


5.14 Relax NG Schema Design

The normative definition of the LegalRuleML syntax is provided bymodular Relax NG schemas.

5.14.1 Modules

The Relax NG schema modules are written in the “chameleon” style,without specifying a target namespace, to maximum the potential for re-use.The LegalRuleML modules follow the monotonic design pattern (citation@@@) developed for the RuleML 1.0 Relax NG schemas and again employed inRuleML Version 1.01 and 1.02, for best compatibility with the includedRuleML modules.This design pattern is based on restricting the Relax NG syntax in amanner that guarantees monotonicity when schema modules are mixedtogether. That is, a language defined by a subset of the modules ofanother language will be a sublanguage of it.


5.14.3 Drivers
Core
Compact and Normal
LRML Drivers

5.15 XSD Schema Derivation
5.5.1 Alternate Drivers

To accomplish the automated conversion from Relax NG to XSD, alternatedriver schemas were constructed (lrml4xsd-compact and lrml4xsd-normal).These schemas differ from the normative Relax NG schemas only in thefollowing ways:* inclusion of a different module (modules-xsd/id_datatype_ID) definingthe type of the key attribute in LegalRuleML elements to be xsd:ID.* inclusion of a different module (modules-xsd/time4xsd) defining thetype of <ruleml:Data> within <ruleml:Time> to be xs:any.* inclusion of a different module (modules-xsd/stripe_required_4xsd)defining the Leaf/Branch-type edge elements by a lenient pattern that isexactly expressible in XSD

* inclusion of a modified RuleML schema suitable for conversion to XSD.

5.15.2 Alternate Relax NG Modules
id_datatype_ID
time4xsd

5.15.3 Conversion using Trang

The Trang software(https://code.google.com/p/jing-trang/downloads/detail?name=trang-20091111.zip)was used to convert the Relax NG schemas into XSD, selecting he optionsto disable abstract elements and select lax processing of elements oftype xs:any.

5.15.4 Post-processing with XSLT

Due to differences in the expressivity of the Relax NG and XSD schemalanguages, and the particularities of the Trang software used to makethe conversion, some post-processing of the generated XSD was necessaryto obtain a valid XSD schema that appropriately approximates theoriginal Relax NG schemas. The post-processing was accomplished withXSLT transformations @@@/xslt/compact-rnc2xsd.xslt and@@@/xslt/normal-rnc2xsd.xslt.


5.16 Differences between RNC and XSD Schemas
5.16.1. xsi:type

XSD schemas allow no constraint on the appearance of the xsi:typeattribute (as in http://www.w3.org/TR/xmlschema-1/#no-xsi), nor may theyalter the definition from the definition built into XSD schemas (@@@refXSD Schemas). As a consequence, to be XSD valid, the value of anyxsi:type attribute must correspond to some predefined type (e.g.xsd:string) or a user-defined type in the schema, such as the RuleMLcomplex types, e.g. ruleml:integer that permits attributes (e.g. key) onthe Data element while still constraining the content to be of typexsd:integer. Further, XSD validation requires that the attributes andcontent of the element on which the xsi:type attribute appears mustconform to that specified type definition.

RNC schemas intentionally treat an xsi:type attribute just like anyother attribute, so it must be explicitly implemented. It is notpossible to implement the xsi:type attribute in Relax NG in a way thatis equivalent to its nature in XSD schemas.

RuleML uses the xsi:type attribute on the <ruleml:Data> element tomanage the datatype. The RuleML RNC schemas implement a limited form ofthe xsi:type attribute on <ruleml:Data>, such that only certain typesare allowed. In particular, the XSD datatypes are allowed, anduser-defined datatypes in the RuleML namespace are implemented whichallow attributes on the ruleml:Data element in addition to simplecontent according to a particular XSD datatype. Otherwise, RuleML doesnot permit the use of xsi:type on elements in the RuleML namespace, aconstraint enforced by the normative RNC schemas.

LegalRuleML introduces no additional elements where the use of xsi:typeis appropriate, and does allow embedded <ruleML:Data> with xsi:typeattributes to be validated by the LegalRuleML RNC, through importing thecorresponding RNC schema module. In addition, LegalRuleML derives arestricted <ruleml:Data> element for use in temporal characteristics.This use of xsi:type is fully supported in the XSD schemas by default,but the XSD schemas are not able to express the constraint against otheruses of xsi:type attributes, and thus are lenient in this regard,relative to the RNC schemas.


5.16.2. xsi:schemaLocation

Like other attributes in the xsi namespace, XSD schemas may notconstrain the occurrence of the xsi:schemaLocation attribute or alterthe definition from the definition built into XSD schemas (@@@ref XSDSchemas).

RNC schemas treat xsi:schemaLocation just like any other attribute.RuleML implements the xsi:schemaLocation attribute in the RNC schemas,and allows it to appear in any element.

For LegalRuleML, the occurrence of the xsi:schemaLocation attribute on askippable edge causes problems in regard to objective #7, due toinability to reconstruct the attribute because it is deleted along withthe edge tags during compactification. In actuality, there does notappear to be any usecase for the xsi:schemaLocation attribute on anyelement other than the root element of the LegalRuleML document. Forthis reason, the xsi:schemaLocation attribute is implemented in theLegalRuleML RNC schemas in this restricted fashion, and the RuleMLmodule that implements the xsi:schemaLocation attribute on elements inthe RuleML namespace is not included. This is a sacrifice of objective#4 in favor of objective #7.

Therefore, the Relax NG schemas are more restrictive than the XSDschemas in this regard.


5.16.3. xsi:nil and xsi: noNamespaceSchemaLocation

Again, XSD schemas allow these attributes to occur anywhere. In bothLegalRuleML and RuleML, these attributes are not defined in the RNCschemas, and so are not permitted anywhere in instances that meet theRNC conformance criterion. Again, the Relax NG schemas are morerestrictive than the XSD schemas in this regard.


5.16.4. xml:base

Attributes in the xml namespace do not have built-in definitions ineither XSD or RNC schemas, and so must be explicitly defined if theiruse is desired.

In RuleML, the xml:base attribute may appear only on the <ruleml:Data>element, where it may be used in the resolution of a data value that isa relative IRI.

In LegalRuleML, the xml:base attribute is additionally permitted on thedocument root <LegalRuleML> element.


RNC and XSD schemas express this equivalently.

5.16.5. @xml:id

In the original RuleML grammar, the xml:id attribute is allowed on anyelement, although there is a compatibility requirement with an xsi:typeattribute if both appear on a Data element. This causes problems inregard to objective #7 if xml:id appears on a skippable edge, since theinformation is lost upon compactification.

In LegalRuleML, @xml:id, or any other attribute, is not allowed onskippable edges in the lrml namespace through the LegalRuleML RNC schemas.

In order to satisfy objective #7 as fully as possible, while somewhatsacrificing #8, the @xml:id attribute is disallowed on skippable RuleMLedges that are embedded in LegalRuleML documents. This is accomplishedin both RNC and XSD schemas through redefining the imported RuleML RNCmodules prior to conversion to XSD.

A separate issue is in regard to enforcing the uniqueness of xml:idvalues within a document. This is partially accomplished by the XSDschemas. It is not possible to enforce this at all in the RNC schemasdue to some interference from wild card patterns, a known issue (https://github.com/relaxng/jing-trang/issues/178 ). This means the XSDschemas are necessarily more restrictive than the RNC schemas in thisregard.


5.16.6 @key/@keyref

In the original RuleML grammar, the key and keyref attributes areallowed on all elements, including skippable edges. Since key has an IRI(or CURIE) datatype rather than an xsd:ID datatype, it is notincompatible with the xml:id attribute, and so both are allowed to occuron the same element. Like xml:id discussed above, this causes problemsin regard to objective #7. This is only an issue on skippable edge inthe RuleML namespace. In RuleML, what is "skippable" varies somewhatdepending on the expressivity. In some languages, <ruleml:if> and<ruleml:then> edges are skippable, and in others they are not. InLegalRuleML, <ruleml:if> and <ruleml:then> edges, within either Rule orImplies elements, are not skippable.

In order to satisfy objective #7 as fully as possible, while somewhatsacrificing #8, the @key attribute is disallowed on skippable RuleMLedges that are embedded in LegalRuleML documents. This is accomplishedin both RNC and XSD schemas through redefining the imported RuleML RNCmodules prior to conversion to XSD.

As mentioned in item 5.16.5, the xsd:ID datatype for key on LegalRuleMLelements is enforced by the XSD but not the RNC schemas. This imposes auniqueness requirement on the key attribute but only relative to otherattributes with xsd:ID datatype, which does not include @key on RuleMLelements, but does include xml:id attributes.



5.16.7 Document Root Element

The LegalRuleML RNC schemas enforce the requirement that the rootelement is lrml:LegalRuleML. This requirement is not enforce in the XSDschemas, although it could be done with a refactoring of thedefinitions. The challenge is that any element defined at the globallevel in an XSD schema is allowed to be the root element. To restrictthe XSD schema so that only LegalRuleML elements may be the root, allelement definitions would have to be contained within the definition ofthe LegalRuleML element. For enhanced readability of the XSD schemas,this requirement is thus only enforced by the RNC schemas.


5.16.8 Leaf/Branch Type Edges

Edges of this type, which only occur in the LegalRuleML namespace, mayoptionally have a child element and optionally may have attributes. Whenthere is a child, the attributes on the edge are typically meaningless,except for xml:id which serves simply to label the edge. When theconversion is made to the RDF-based representation of the abstractsyntax, the key and keyref attributes are ignored, while an xml:idattribute is honored. Thus if a document is parsed into the abstractsyntax through the XSLT to RDF, and then serialized back into the XMLsyntax, the (meaningless) key and keyref attributes are lost. The idealsolution would be to disallow the @key and @keyref attributes on thistype of edge element.Unfortunately, it is not possible to construct an XSD 1.0 schema thatallows attributes on the edge only when it does not have a child.However this is possible with RNC, and probably is possible withSchematron or XSD 1.1. Also, the removal of such (meaningless)attributes is easily accomplished with XSLT, so a validating XSLTtransformation can be constructed for this constraint.

The RNC schema main drivers (schemas/relaxng/lrml-compact.rnc andschemas/relaxng/lrml-normal.rnc) implement the choice (exclusive or)between attributes and content on edges of Leaf/Branch type. This favorsobjective #3, as well as clarity, at the expense of an additionaldeviation between RNC and XSD schemas. The RNC schema drivers that areused for the conversion to XSD (schemas/relaxng/lrml4xsd-compact.rnc andschemas/relaxng/lrml4xsd-normal.rnc) implement the attributes andcontent of Leaf/Branch-type edges as an inclusive or, so that theconverter does not need to approximate.

There is a validator XSLT(xslt/validators/lrml_validator-leaf-branch.xslt) that strips theseattributes away from a LegalRuleML document, and also uses thexsl:message capability to inform the user when such attributes werepresent in the input.


5.17 Prefix Mapping XSLT Transformation

A number of RuleML and LegalRuleML attributes have values which shouldbe treated as CURIEs. That is, they should be evaluated to IRIsaccording to a prefix mapping, which is defined by the Prefix element.

The XSLT at xslt/normalizer/lrml_normal_canonicalizer.xslt performsCURIE evaluation, in addition to some other modifications, aspreparation for appying the parsing XSLT that produces RDF(schemas/xslt/triplifyMerger-ids.xsl)

This evaluation also applies to the values of @refID within Referenceand LegalReference, which are not constrained to be IRIs or CURIEs. Thechoice of xs:string for the datatype of the prefix mapping (also called@refID) enables this usage, allowing CURIE-like abbreviation to be usedwithin Reference and LegalReference, as illustrated inexamples/draft/ex2.1.10.a-v1-compact.lrml for identifiers of Akoma Ntoso.


5.18
The following constraints are enforced by neither RNC nor XSD schemas:

5.18.1. The prefix mappings and the abbreviations they are to applyshould be such that the evaluation result is conformant to the schema(s).

5.18.2. RuleML collection edges, i.e. those edges with @indexattributes, must have values of @index in agreement with their positionin the node set of sibling collection edges.

5.18.3. IRIs occurring as attribute values, whether originally expressedas an IRI or a CURIE, are required to be fully conformant to RFC3987[18]. In the case of CURIEs, this restriction applies afterexpansion to an IRI according to the prefix.

5.18.4. The value of any keyref attribute must match the value of a keyattribute within the @@@key/keyref closure of the@@@ document.

5.18.5. Each occurrence of @key on LegalRuleML and RuleML elements musthave a unique value (after deletion of the leading colon on values of@key within RuleML elements) within the @@@key/keyref closure of the@@@document.

5.18.6. In the LegalRuleML RDF abstract syntax representation, tripleswhose properties correspond to skippable edges in the concrete syntaxmust not be reified.



5.19 Validating XSLT Transformations

5.19.1 Conformance to the additional constraint 5.18.1 may be checked byapplying the XLST transformation/xslt/normalizer/lrml_prefix_evaluation.xslt , and validating the output.

This transformation is abstract-syntax preserving.

5.19.2 Conformance to the additional constraint 5.18.2 may be checkedthrough the XLST transformation/xslt/validator/lrml_sequential-indexing.xslt .This transformation is abstract-syntax preserving when applied followingthe /xslt/validator/lrml_sequential-indexing.xslt transformation .

Follow-Ups:
- Re: [legalruleml] Revised Section 5
  - From: "Wyner, Adam Zachary" <azwyner@abdn.ac.uk>