Re: XML Max Character Value

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

To: Alan Gutierrez <alan-xml-dev@e...>
Subject: Re: XML Max Character Value
From: Henri Sivonen <hsivonen@i...>
Date: Sun, 14 Aug 2005 13:15:13 +0300
Cc: XML Developers List <xml-dev@l...>
In-reply-to: <20050813111909.GD4299@m...>
References: <075E1759251CCB49ABF05D8F742AFE270695FDF6@R...> <42FD9944.8010209@o...> <20050813111909.GD4299@m...>

On Aug 13, 2005, at 14:19, Alan Gutierrez wrote:

>     Am I seeing that with Unicode in Java, you need to work with
>     String and not with individual char? That puts a dent in my
>     algorithm, which advanced along the characters in the string.

It depends on what exactly you are doing. A Java char is not a Unicode 
character but a UTF-16 code unit. The values \u0000 and \uFFFF should 
never occur in XML and can be used as sentinels if your algorithm works 
on UTF-16 code units. For the purpose of indexing text, working on 
UTF-16 code units as opposed to working on Unicode characters may well 
be good enough. In that case, a surrogate pair can be treated as two 
adjacent "characters". (Note that even when operating on UTF-32, you 
can have tightly-coupled characters when there is a base character 
followed by combining marks, so working on Unicode characters does not 
buy you inter-character independence.)

-- 
Henri Sivonen
hsivonen@i...
http://hsivonen.iki.fi/

Follow-Ups:
- Re: XML Max Character Value
  - From: Alan Gutierrez <alan-xml-dev@e...>

References:
- RE: XML Max Character Value
  - From: "Derek Denny-Brown" <derekdb@m...>
- Re: XML Max Character Value
  - From: Bob Foster <bob@o...>
- Re: XML Max Character Value
  - From: Alan Gutierrez <alan-xml-dev@e...>

Prev by Date: Canadian Semantic Web Working Symposium (CSWWS 2006)
Next by Date: Re: XML Max Character Value
Previous by thread: Re: XML Max Character Value
Next by thread: Re: XML Max Character Value
Index(es):
- Date
- Thread

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >