[Home] [By Thread] [By Date] [Recent Entries]

  • From: "Costello, Roger L." <costello@m...>
  • To: "xml-dev@l..." <xml-dev@l...>
  • Date: Fri, 2 Nov 2012 15:27:58 +0000

Liam, hello,

> Another is to use an escaping mechanism - 
> what I call "UCODE", in which an upper-case 
> "U" is followed by hexadecimal and a trailing 
> X to mark the end

To be sure I understand, this text (2 denotes hex two, 3 denotes hex three):

    2Hello World3

is converted to this XML:

   <text>U2XHello WorldU3X</text>

Is that the approach Liam? 

Have you documented this approach anywhere?

/Roger

-----Original Message-----
From: Liam R E Quin [mailto:liam@w...] 
Sent: Friday, November 02, 2012 11:19 AM
To: Costello, Roger L.
Cc: xml-dev@l...
Subject: Re:  Storing illegal XML 1.0 characters in the Unicode Private Use Area

On Wed, 2012-10-31 at 18:04 +0000, Costello, Roger L. wrote:
[...]
> One approach is to move any illegal characters into the Private Use Area: 
Another is to use an escaping mechanism - e.g. what I call "UCODE", in
which an upper-case "U" is followed by hexadecimal and a trailing X to
mark the end; this can safely be used in XML element names for example.

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org freenode/#xml
Co-author, 5th edition of "Beginning XML" - Wrox, 2012



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member