Table of contentsAppendices |
3.1 URI Reference Encoding and EscapingURI Reference Encoding and EscapingThe set of characters allowed in xml:base attributes is the same as for XML, namely [Unicode]. However, some Unicode characters are disallowed from URI references, and thus processors Must, May, etc. encode and escape these characters to obtain a valid URI reference from the attribute value. The disallowed characters include all non-ASCII characters, plus the excluded characters listed in Section 2.4 of [RFC2396], except for the number sign (#) and percent sign (%) characters and the square bracket characters re-allowed in [RFC2732]. Disallowed characters Must, May, etc. be escaped as follows:
|