[Home] [By Thread] [By Date] [Recent Entries]

  • From: Michael Brennan <Michael_Brennan@a...>
  • To: xml-dev@l...
  • Date: Thu, 26 Jul 2001 16:30:08 -0700

I think I'm answering my own question, here. I just noticed the "UCharacter"
class in this library, which is "designed to be a generic code point
information source that handles surrogate pairs". The docs says it supports
Unicode 3.0.

I think folks looking for an immediate Java-based solution should check this
out. It's open source and uses the X open-source license. 

> From: Michael Brennan [mailto:Michael_Brennan@a...]
> Sent: Thursday, July 26, 2001 2:34 PM
> To: xml-dev@l...
> Subject: RE: Java/Unicode brain damage
> 
> 
> I don't fully understand the issues, here, (I guess I have 
> some studying to
> do) but I'd be interested in hearing from the experts on this 
> regarding
> IBM's ICU4J (http://oss.software.ibm.com/icu4j/). Does this 
> deal better with
> these issues then the standard Java classes? Does the UTF16 
> class help with
> these issues? I notice references to "surrogates" in the API, 
> so it seems
> like it has support for surrogate pairs, but I'm not saavy enough with
> Unicode issues to make a judgement, here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member