OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

xri-editors message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [xri-editors] Xref escaping rules


Here's the proposed text around this issue if you're interested.

XRIs MUST use escaped encoding as described in [RFC2396]. XRIs also have
additional escaping rules described below.

XRIs can contain nested URIs as cross-references (see section [TODO]). These
nested URIs can contain characters that, if unescaped, would cause
misinterpretation when parsing the enclosing XRI. Consider the following
XRI.

xri:@example/(http://www.example.com?id=1)

The generic parsing algorithm described in [RFC2396] would separate the
above XRI into the following components

scheme = xri
authority = <undefined>
path = @example/(http://www.example.com?
query = id=1)

The desired separation is

scheme = xri
authority = <undefined>
path = @example/(http://www.example.com?id=1)
query = <undefined>

To avoid this type of misinterpretation, a URI (including an XRI) must be
escaped before it can be used as a cross-reference. In particular, it  MUST
be converted such that the percent "%" character is escaped as "%25", the
question mark "?" character is escaped as "%3F" and the pound sign "#" is
escaped as "%28".

The example above, then, would be correctly expressed as

xri:@example/(http://www.example.com%3Fid=1)

Cross-references may be nested to any depth. The above rule is applied to
each nesting level. For example, if the above XRI was used as a
cross-reference, the percent sign "%" would be escaped to "%25".

xri:@example1/( xri:@example/(http://www.example.com%253Fid=1))

Furthermore, parentheses "(" and ")" MUST be escaped if they appear in a URI
but do not delimit an XRI cross-reference.  If an XRI cross-reference is
embedded within an enclosing cross-reference, the parentheses delimiting the
inner cross-reference MAY be escaped, but this is NOT RECOMMENDED because it
complicates comparison. For example

xri:@example1/( xri:@example/(http://www.example.com%253Fid=1))

is preferred to

xri:@example1/( xri:@example/%28http://www.example.com%253Fid=1%29)

although both are legal and equivalent.

Before a URI may be used as a cross-reference, then, the following steps
MUST be performed in order:

1.	Escape all percent "%" characters as "%25".
2.	Escape all parentheses "(" and ")" that do not delimit XRI
cross-references as "%28" and "%29", respectively.
3.	Escape all pound "#" characters as "%23".
4.	Escape all question mark "?" characters as "%3F".

Note that the URI is assumed to be in an "escaped" form appropriate to its
scheme before the above rules are applied.

Before a cross-reference is evaluated it MUST first be unescaped. If a
cross-reference is contained in an enclosing cross-reference, the enclosing
cross-reference MUST be unescaped before the inner cross-reference is
evaluated and unescaped.

-----Original Message-----
From: Drummond Reed [mailto:drummond.reed@onename.com] 
Sent: Monday, June 30, 2003 2:30 PM
To: Dave McAlpin; Wachob, Gabe; xri-editors@lists.oasis-open.org
Subject: RE: [xri-editors] Xref escaping rules

+2 (except that it's %28 not #28).

=Drummond 

-----Original Message-----
From: Dave McAlpin [mailto:dave.mcalpin@epokinc.com]
Sent: Monday, June 30, 2003 1:41 PM
To: 'Wachob, Gabe'; Drummond Reed; xri-editors@lists.oasis-open.org
Subject: RE: [xri-editors] Xref escaping rules

Sorry, I misunderstood the question. I'm allowing both in the normative
text
and saying the second is preferred.

-----Original Message-----
From: Wachob, Gabe [mailto:gwachob@visa.com]
Sent: Monday, June 30, 2003 1:35 PM
To: 'Dave McAlpin'; Wachob, Gabe; 'Drummond Reed';
xri-editors@lists.oasis-open.org
Subject: RE: [xri-editors] Xref escaping rules

Really?

So you'd prefer:

xri://foo.bar/(+Gabe/#28+stuff#29)

over

xri://foo.bar/(+Gabe/(+stuff))

I don't see a reason why we need to do escaping for the () on XRI values
as
cross references..

        -Gabe

> -----Original Message-----
> From: Dave McAlpin [mailto:dave.mcalpin@epokinc.com]
> Sent: Monday, June 30, 2003 12:48 PM
> To: 'Wachob, Gabe'; 'Drummond Reed'; xri-editors@lists.oasis-open.org
> Subject: RE: [xri-editors] Xref escaping rules
>
>
> I think these rules apply to cross-references in general,
> both general URIs
> and XRIs.
>
> -----Original Message-----
> From: Wachob, Gabe [mailto:gwachob@visa.com]
> Sent: Monday, June 30, 2003 12:05 PM
> To: 'Dave McAlpin'; Wachob, Gabe; 'Drummond Reed';
> xri-editors@lists.oasis-open.org
> Subject: RE: [xri-editors] Xref escaping rules
>
> Dave-
>       This seems straightforward and I don't know why it
> wasn't the first
> proposal on the table. I can't seem to figure out a problem
> with it, so +1
> unless someone can discover an issue with it.
>
>       BTW, the way I read it is that this escaping is only
> neccesary for
> URIs as xrefs, not XRI values as xrefs...
>
>       -Gabe
>
> > -----Original Message-----
> > From: Dave McAlpin [mailto:dave.mcalpin@epokinc.com]
> > Sent: Monday, June 30, 2003 10:08 AM
> > To: 'Wachob, Gabe'; 'Drummond Reed';
> xri-editors@lists.oasis-open.org
> > Subject: RE: [xri-editors] Xref escaping rules
> >
> >
> > I assumed we would escape iteratively, once for each level
> of nesting
> >
> > 1) Embed http://foo.com?id=21#bar in xri:@foo/baz as a
> cross-reference
> >
> > a) Escape cross-reference to get
> > http://foo.com%3Fid=21%23bar
> >
> > b) Add as cross-reference
> > xri:@foo/baz/(http://foo.com%3Fid=21%23bar)
> >
> > When the cross-reference is extracted, it's unescaped to
> >
> > http://foo.com?id=21#bar
> >
> > 2) Embed http://foo.com?id=21-%28widget#bar in xri:@foo/baz as a
> > cross-refrence
> >
> > a) Escape the cross-reference to get
> > http://foo.com%3Fid=21-%2528widget%23bar
> >
> > b) Append as cross-reference
> > xri:@foo/baz/(http://foo.com%3Fid=21-%2528widget%23bar)
> >
> > When the cross-reference is extracted, it's unescaped to
> >
> > http://foo.com?id=21-%28widget#bar
> >
> > 3) A complex nested example. Embed http://foo.com?id=21-%28widget in
> > xri:+foo.bar.<xref>, which is in turn embedded in
> > xri://foo.bar/<xref>/baz
> >
> > a) First escape the innermost cross-reference to get
> > http://foo.com%3Fid=21-%2528widget
> >
> > b) Add as cross-reference
> > xri:+foo.bar.(http://foo.com%3Fid=21-%2528widget)
> >
> > c) Escape the cross-reference to get
> > xri:+foo.bar.(http://foo.com%253Fid=21-%252528widget)
> >
> > d) Add as cross-reference
> > xri://foo.bar/(xri:+foo.bar.(http://foo.com%253Fid=21-%252528w
> > idget)))/baz
> >
> > When the XRI is evaluated, the outermost cross-reference is
> extracted
> >
> > xri:+foo.bar.(http://foo.com%253Fid=21-%252528widget))
> >
> > And unescaped to give
> >
> > xri:+foo.bar.(http://foo.com%3Fid=21-%2528widget))
> >
> > The next level cross-reference is extracted
> >
> > http://foo.com%3Fid=21-%2528widget
> >
> > and unescaped to give
> >
> > http://foo.com?id=21-%28widget
> >
> > Dave
> >
> > -----Original Message-----
> > From: Wachob, Gabe [mailto:gwachob@visa.com]
> > Sent: Monday, June 30, 2003 9:19 AM
> > To: 'Drummond Reed'; xri-editors@lists.oasis-open.org
> > Subject: RE: [xri-editors] Xref escaping rules
> >
> > Drummond-
> >     This seems pretty reasonable.
> >
> >     Have you considered the case where you have two levels of cross
> > referencing? That is, a cross reference which contains an XRI
> > value that
> > itself has a cross reference?
> >
> > xri://foo.bar/(+foo.bar.(http://www.foo.com#3ffoo))/baz
> >
> > vs
> >
> > xri://foo.bar/(+foo.bar.(http://www.foo.com(#3f)foo))/baz
> >
> > vs
> >
> > xri://foo.bar/(+foo.bar.(http://www.foo.com((#3f))foo))/baz
> >
> > I think you need to amend the rules about escaping/unescaping
> > to account for
> > nesting..
> >
> >     -Gabe
> >
> >
> > > -----Original Message-----
> > > From: Drummond Reed [mailto:drummond.reed@onename.com]
> > > Sent: Sunday, June 29, 2003 1:15 PM
> > > To: xri-editors@lists.oasis-open.org
> > > Subject: [xri-editors] Xref escaping rules
> > >
> > >
> > > Here's the escaping proposal that I brought up on the
> last TC call.
> > > 
> > > The issue is that an XRI or URI is used as a cross-reference
> > > may contain
> > > characters that must be escaped in order to conform with 2396.
> > > Specifically:
> > > 
> > > * "?" may not appear until the start of the one-and-only
> > query segment
> > > allowed by 2396.
> > > * The same is true with "#" for fragments
> > > * Lastly, an UNMATCHED opening or closing paren must be
> > > escaped or else
> > > the cross-reference will not parse correctly. (Note that
> > > MATCHED opening
> > > and closing parens inside do not need escaping because they
> > represent
> > > either a second-level nested cross-reference or a parenthetically
> > > enclosed string in a native URI but either way will not be
> > ambiguous.)
> > > 
> > > However we can't use just 2396 escaping rules (i.e., "%xx"
> > where xx is
> > > the hex value of the character) for these four characters
> > > because it if
> > > the embedded XRI or URI contained those same escape sequences
> > > natively,
> > > it would be ambiguous which to unescape and not to
> unescape when the
> > > cross-reference is extracted. For example, if a URI already
> > contained
> > > the escape sequence %3F (the "?" character) before it was
> > > turned into a
> > > cross-reference, how would the parser know NOT to unescape it when
> > > extracting the cross-reference?
> > > 
> > > The proposed solution is to apply two special escaping
> rules to any
> > > URI/XRI string being embedded as a cross-reference. They must
> > > be applied
> > > in the following order:
> > > 
> > > 1) Parse the string to find any instance of "%3F", "%23",
> "%28", or
> > > "%29" and enclose these in parens. Example: "(%3F)".
> > > 2) Escape any "?", "#", or UNMATCHED paren as per 2396 escaping.
> > > Example: a URI that contained a query would have the "?"
> turned into
> > > %3F.
> > > 
> > > To extract a cross-reference, apply the same two rules in
> > > reverse, i.e.:
> > > 
> > > 1) From the extracted cross-reference, escape any instance of
> > > %3F, %23,
> > > %28, or %29 EXCEPT those enclosed in parens.
> > > 2) For any instance of "%3F", "%23", "%28", or "%29" enclosed
> > > in parens,
> > > remove the parens.
> > > 
> > > Example:
> > > 
> > > Embed: URI "http://foo.com/?id=21#bar" in XRI "xri:@foo/baz/"
> > > Result: "xri:@foo/baz/(http://foo.com/%3Fid=21%23bar)"
> > > 
> > > 
> > > Embed: URI "http://foo.com/?id=21-%28widget#bar" in XRI
> > > "xri:@foo/baz/"
> > > Result: "xri:@foo/baz/(http://foo.com/%3Fid=21-(%28)widget%23bar)"
> > > 
> > > Does this work for everyone? Any other proposals?
> > > 
> > > =Drummond
> > > 
> > > 
> > > 
> > > 
> > >
> > >
> >
> ---------------------------------------------------------------------
> > > To unsubscribe, e-mail:
> xri-editors-unsubscribe@lists.oasis-open.org
> > > For additional commands, e-mail:
> > xri-editors-help@lists.oasis-open.org
> > >
> >
> >
> ---------------------------------------------------------------------
> > To unsubscribe, e-mail: xri-editors-unsubscribe@lists.oasis-open.org
> > For additional commands, e-mail:
> xri-editors-help@lists.oasis-open.org
> >
> >
> >
> >
> >
> ---------------------------------------------------------------------
> > To unsubscribe, e-mail: xri-editors-unsubscribe@lists.oasis-open.org
> > For additional commands, e-mail:
> xri-editors-help@lists.oasis-open.org
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: xri-editors-unsubscribe@lists.oasis-open.org
> For additional commands, e-mail: xri-editors-help@lists.oasis-open.org
>
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: xri-editors-unsubscribe@lists.oasis-open.org
For additional commands, e-mail: xri-editors-help@lists.oasis-open.org






[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]