[Home] [By Thread] [By Date] [Recent Entries]


At 1:49 AM -0700 4/25/04, Jeff Rafter wrote:

>Because I am currently not doing a transformation as described in the note I
>decided to simply discard additional "x" characters. I still have a question
>about the hexadecimal number though. Because the language I am working in
>has a char limit of U+FFFF, I decided to limit my the input to the same
>number. Again, I have the potential of multiple leading "0" characters, but
>I again discard them. Does this sound reasonable?

I'm certainly no Relax expert, but on the face of it is does *NOT* 
sound reasonable. In general XML and Unicode processing, one *MUST* 
handle characters with code points beyond U+FFFF. They are not 
optional.  This is true even if your programming language (Java 
perhaps?) has inadequate support for them.

Just possibly, if this only applies to characters in XML names, not 
content or attribute values, and only XML 1.0 is supported (Not XML 
1.1) then this may be reasonable. Otherwise it's clearly not.
-- 

   Elliotte Rusty Harold
   elharo@m...
   Effective XML (Addison-Wesley, 2003)
   http://www.cafeconleche.org/books/effectivexml
   http://www.amazon.com/exec/obidos/ISBN%3D0321150406/ref%3Dnosim/cafeaulaitA

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member