OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

odata message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: [OASIS Issue Tracker] (ODATA-1599) Clarify support for supplementary plane characters in OData Identifiers


Michael Pizzo created ODATA-1599:
------------------------------------

             Summary: Clarify support for supplementary plane characters in OData Identifiers
                 Key: ODATA-1599
                 URL: https://issues.oasis-open.org/browse/ODATA-1599
             Project: OASIS Open Data Protocol (OData) TC
          Issue Type: New Feature
          Components: CSDL JSON , CSDL XML
    Affects Versions: V4.01_OS
            Reporter: Michael Pizzo
             Fix For: V4.02


inÂ[OData Common Schema Definition Language (CSDL) XML Representation Version 4.01 (oasis-open.org)|http://docs.oasis-open.org/odata/odata-csdl-xml/v4.01/odata-csdl-xml-v4.01.html#sec_SimpleIdentifier]Âwe define the rules for a legal identifier (for instance, a property name) in OData as follows:
{quote}A simple identifier is a Unicode character sequence with the following restrictions:
 * It consists of at least one and at most 128 Unicode characters (code points).
 * The first character MUST be the underscore character (U+005F) or any character in the Unicode category âLetter (L)â or âLetter number (Nl)â.
 * The remaining characters MUST be the underscore character (U+005F) or any character in the Unicode category âLetter (L)â, âLetter number (Nl)â, âDecimal number (Nd)â, âNon-spacing mark (Mn)â, âCombining spacing mark (Mc)â, âConnector punctuation (Pc)â, and âOther, format (Cf)â.

Non-normatively speaking it starts with a letter or underscore, followed by at most 127 letters, underscores or digits.
{quote}
The Unicode Category L is defined asÂLu | Ll | Lt | Lm | Lo.

Where Lo is "Other Letter".

Question: did we intent to include Surrogates (see [The Unicode Standard, Version 15.0|https://www.unicode.org/versions/Unicode15.0.0/ch03.pdf#G2630]Â;(or, more properly, supplemental plane characters) as valid characters in a property name?

The rules for valid property names were largely taken from programming languages in order to facilitate mapping between OData types and language objects. C#, for example, does not allow surrogates in property names.

Note thatÂODATA-1348Âtries to distinguish between characters and code points with regard to things like max string length, but this doesn't address character validity within an identifier.



--
This message was sent by Atlassian Jira
(v8.3.3#803004)


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]