[Home] [By Thread] [By Date] [Recent Entries]


Daniel Veillard scripsit:

>   Just to put some emphasis to what John Cowan already said, I'm afraid
> of the cost of normalizing on-the-fly, the algorithms I could found
> in the Unicode annexes were just scary (in term of complexity and memory
> requirement) maybe there is simpler lean and cheap normalization 
> algorithms (I would like pointers ;-) but definitely that cost is better
> done once at generation time. Apparently normalization checking is
> slightly lighter and as said that check is optional c.f. 2.13 wording.

ICU is, as always, the gold standard for this kind of thing.  It has
both normalizing and normalization-checking algorithms.

-- 
John Cowan                              <jcowan@r...>
http://www.ccil.org/~cowan              http://www.reutershealth.com
Unified Gaelic in Cyrillic script!
        http://groups.yahoo.com/group/Celticonlang

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member