[Home] [By Thread] [By Date] [Recent Entries]

  • From: Andrew Welch <andrew.j.welch@g...>
  • To: Vincent De Groote <list.encelade@g...>
  • Date: Thu, 12 Mar 2009 22:51:53 +0000


> Does the string functions (string-length, substring, ... ) work with Unicode
> character units, or the implementation code point units ?
> If they count as implementation code point units, does it have a fixed type
> (like utf16) ?  Or is it implementation dependant ?


afaiu, utf-16 is still variable length, as some characters are "surrogate pairs", so 2 lots of 16 bits.

Either way, the string functions count characters, not bytes, so you don't have to worry about the underlying encoding.




--
Andrew Welch
http://andrewjwelch.com
Kernow: http://kernowforsaxon.sf.net/



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member