[Home] [By Thread] [By Date] [Recent Entries]

  • From: John Cowan <jcowan@r...>
  • To: "xml-dev@x..." <xml-dev@x...>
  • Date: Mon, 06 Mar 2000 17:08:50 -0500

Mike Brown wrote:

> I have a question, though. I have seen a reference somewhere saying that
> Java characters and strings are UCS-2 encoded, and I saw a reference
> somewhere else saying they are UTF-16 encoded. Which is it?

Java pretends surrogates don't exist, basically, and will spit out
bad UTF-8 for a surrogate pair.  So you can call it UCS-2, if you
want, but it's really just a broken implementation.

-- 

Schlingt dreifach einen Kreis vom dies! || John Cowan <jcowan@r...>
Schliesst euer Aug vor heiliger Schau,  || http://www.reutershealth.com
Denn er genoss vom Honig-Tau,           || http://www.ccil.org/~cowan
Und trank die Milch vom Paradies.            -- Coleridge (tr. Politzer)

***************************************************************************
This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/
***************************************************************************

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member