[Home] [By Thread] [By Date] [Recent Entries]


Rick Jelliffe scripsit:

> That makes it clear that control characters are unlike other characters,
> for which Unicode provides "semantics". The only C0 or C1 characters for
> which Unicode provides "semantics" are TAB, CR, LF and NEL.

XML already, however, allows the use of undefined codepoints, which have
far less semantics than the C0 controls.  And a good thing too, or 
Ethiopic and Thaana and Canadian Aboriginal Syllabics would be totally
locked out of XML (they are post-Unicode-2.0) instead of merely
banned in XML names.

-- 
John Cowan           http://www.ccil.org/~cowan              cowan@c...
Please leave your values        |       Check your assumptions.  In fact,
   at the front desk.           |          check your assumptions at the door.
     --sign in Paris hotel      |            --Miles Vorkosigan

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member