[Home] [By Thread] [By Date] [Recent Entries]


Bullard, Claude L (Len) wrote:

> An XML server side process is getting a text that has been pasted in. 
> Within the text are non-printing characters which 
> are apparently handled as UTF-16 and the XML processor 
> on the server replaces with character entities. On 
> the client side, the XML processor says these are not 
> legal characters and is apparently treating them 
> as UTF-8.  What is the best way to handle these?

Well, if you know the encoding of the pasted-in text, you're OK, because 
you can either declare in the XML declaration, or convert it to 
something else, or just turn 'em all into numeric char refs.  There's 
the potential problem of illegal non-XML characters, but you're going to 
have to be prepared to deal with that.

Of course, if you *don't* know the encoding of the pasted-in characters, 
you're basically hosed.  -Tim



Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member