RE: [xsl] How to read the encoding of an XML document

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

Subject: RE: How to read the encoding of an XML document
From: "Diamond, Jason" <Jason.Diamond@xxxxxxx>
Date: Thu, 25 Oct 2001 18:03:44 -0600

> > while UTF-16 uses 2 bytes for most characters.
> since it's gone midnight and I no longer need to be helpful in this
> thread I could query the definition of most here, xFFFF not being most
> of x10FFFF by some definitions of most. (Although depending whether you
> view an unallocated unicode slot as a character, the numbers might be
> different) 

If the Unicode scalar value is less that 0xFFFF it only requires two bytes
using UTF-16 to encode but if it's greater than 0xFFFF then UTF-16
represents that value using a "surrogate pair" which is four bytes total in
length. Since most Unicode characters have a value that's less than 0xFFFF,
most characters will only require two bytes to encode.

UTF-16 can encode all characters in the 0 to 0x10FFFF range. And so can
UTF-8 and UTF-32. UCS-2, however, cannot encode characters above 0xFFFF.

Jason.

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list

Current Thread
RE: How to read the encoding of an XML document Diamond, Jason - Thu, 25 Oct 2001 20:14:37 -0400 (EDT) <= David Carlisle - Thu, 25 Oct 2001 20:47:35 -0400 (EDT) <Possible follow-ups> Diamond, Jason - Thu, 25 Oct 2001 20:58:34 -0400 (EDT) David Carlisle - Fri, 26 Oct 2001 05:26:49 -0400 (EDT) Joerg Pietschmann - Fri, 26 Oct 2001 03:42:17 -0400 (EDT)

<- Previous	Index	Next ->
RE: Can't pass parameters acr, Joerg Pietschmann	Thread	Re: How to read the encoding , David Carlisle
Re: How to read the encoding , David Carlisle	Date	RE: use of starts-with(), Chris Bayes
	Month

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >