OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

entity-resolution message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [Elist Home]


Subject: Just the facts about public ids, URNs, and URIs


The following table shows which ASCII characters are legal as what
in XML public ids, RFC 2141 URNs, and RFC 2396 URIs.  See below
for explanations of terms.  Use a fixed font.

Character(s) 
	Pubid	URN		URI		Status
============ 
	=====	===		===		======

LATIN CAPITAL LETTER ?	yes	upper		lowalpha	NORM
LATIN SMALL LETTER ?	yes	lower		upalpha		NORM
DIGIT *			yes	number		digit 
	NORM
HYPHEN-MINUS 
	yes	other		mark		NORM
LEFT PARENTHESIS	yes	other		mark 
	NORM
RIGHT PARENTHESIS	yes	other		mark		NORM
FULL STOP		yes	other		mark 
	NORM
EXCLAMATION MARK	yes	other		mark		NORM
ASTERISK 
	yes	other		mark		NORM
LOW LINE		yes	other 
	mark		NORM
PLUS SIGN		yes	other		reserved 
AVAIL
COMMA 
		yes	other		reserved	AVAIL
COLON 
		yes	other		reserved	AVAIL
EQUALS SIGN		yes	other		reserved	AVAIL
SEMICOLON 
	yes	other		reserved	AVAIL
COMMERCIAL AT		yes	other		reserved	AVAIL
DOLLAR SIGN		yes	other		reserved	AVAIL
QUESTION MARK		yes	reserved	reserved	ENCODE
SOLIDUS 
		yes	reserved	reserved	ENCODE
NUMBER SIGN		yes	reserved	delims		ENCODE
PERCENT SIGN		yes	reserved	delims		ENCODE
SPACE 
		yes	excluded	space		ENCODE
APOSTROPHE 
	yes	excluded	mark		ENCODE
AMPERSAND 
	no	excluded	reserved	AVAIL
TILDE 
		no	excluded	mark		NULL
REVERSE SOLIDUS		no	excluded	delims		NULL
QUOTATION MARK		no	excluded	delims		NULL
LESS-THAN SIGN		no	excluded	delims		NULL
GREATER-THAN SIGN	no	excluded	delims		NULL
LEFT SQUARE BRACKET	no	excluded	unwise		NULL
RIGHT SQUARE BRACKET	no	excluded	unwise		NULL
CIRCUMFLEX		no	excluded	unwise		NULL
GRAVE ACCENT		no	excluded	unwise		NULL
LEFT CURLY BRACE	no	excluded	unwise		NULL
VERTICAL LINE		no	excluded	unwise		NULL
RIGHT CURLY BRACE	no	excluded	unwise		NULL


URN upper, lower, number, other: MAY be used without %-encoding.
URN reserved: SHOULD NOT be used without %-encoding.
URN excluded: MUST NOT be used without %-encoding.

URI lowalpha, upalpha, digits, mark: MAY be used without %-encoding;
	%-encoding MUST NOT affect semantics.
URI reserved: MAY be used without %-encoding;
	%-encoding MAY affect semantics.
URI space, delims, unwise: MUST NOT be used without %-encoding.

Status NORM: No encoding needed, can't be used as syntax.
Status ENCODE: MUST be encoded (%-encoded or privately).
Status AVAIL: Available for use as syntax character
	if literal use is %-encoded (AMPERSAND has no literal use).
Status NULL: Not usable in pubids, included for completeness.

-- 
There is / one art             || John Cowan <jcowan@reutershealth.com>
no more / no less              || http://www.reutershealth.com
to do / all things             || http://www.ccil.org/~cowan
with art- / lessness           \\ -- Piet Hein



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [Elist Home]


Powered by eList eXpress LLC