Table of contentsAppendices |
9 Internationalized Resource Identifiers (IRIs)Internationalized Resource Identifiers (IRIs)Work is currently in progress to produce an RFC defining Internationalized Resource Identifiers (IRIs). Since this work is not yet complete, this section gives a syntactic definition of IRIs for the purposes of this specification. The XML Core Working Group expects to issue an erratum replacing this section with a reference to the RFC when it is published. Users defining namespaces are advised to restrict namespace names to URIs until the RFC is published and software supporting IRIs is in common use. Implementors are likewise advised not to reject namespace names that violate the drafts in terms of the allowed characters. For a more general definition and discussion of IRIs see [IRIdraft5] (work in progress). URI references are restricted to a subset of the ASCII characters; IRI references allow most Unicode characters from #xA0 onwards. Earlier drafts of the IRI RFC (eg [IRIdraft3] ) also allowed some of the disallowed ASCII characters, but the current draft ( [IRIdraft5] ) does not. The additional characters allowed in IRIs by [IRIdraft5] are:
An IRI reference is a string that can be converted to a URI reference by applying the following steps:
NOTE: |