[Home] [By Thread] [By Date] [Recent Entries]
Mike Brown wrote: > I have a question, though. I have seen a reference somewhere saying that > Java characters and strings are UCS-2 encoded, and I saw a reference > somewhere else saying they are UTF-16 encoded. Which is it? Java pretends surrogates don't exist, basically, and will spit out bad UTF-8 for a surrogate pair. So you can call it UCS-2, if you want, but it's really just a broken implementation. -- Schlingt dreifach einen Kreis vom dies! || John Cowan <jcowan@r...> Schliesst euer Aug vor heiliger Schau, || http://www.reutershealth.com Denn er genoss vom Honig-Tau, || http://www.ccil.org/~cowan Und trank die Milch vom Paradies. -- Coleridge (tr. Politzer) *************************************************************************** This is xml-dev, the mailing list for XML developers. To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev List archives are available at http://xml.org/archives/xml-dev/ ***************************************************************************
|

Cart



