[Home] [By Thread] [By Date] [Recent Entries]
Daniel Veillard scripsit:
> Just to put some emphasis to what John Cowan already said, I'm afraid
> of the cost of normalizing on-the-fly, the algorithms I could found
> in the Unicode annexes were just scary (in term of complexity and memory
> requirement) maybe there is simpler lean and cheap normalization
> algorithms (I would like pointers ;-) but definitely that cost is better
> done once at generation time. Apparently normalization checking is
> slightly lighter and as said that check is optional c.f. 2.13 wording.
ICU is, as always, the gold standard for this kind of thing. It has
both normalizing and normalization-checking algorithms.
--
John Cowan <jcowan@r...>
http://www.ccil.org/~cowan http://www.reutershealth.com
Unified Gaelic in Cyrillic script!
http://groups.yahoo.com/group/Celticonlang
|

Cart



