Subject: Re: XSL and international characters
From: Jirka Kosek <jirka@xxxxxxxx>
Date: Wed, 05 Dec 2001 09:19:56 +0100
|
Chris Bayes wrote:
>
> And! What does it say?
> If you want me to stare at another w3 spec for hours you will have to
> wait until tomorrow evening
It states that international characters should be encoded in UTF-8 in
URLs. Byte sequences of UTF-8 are writen as %xx in URLs. This means that
%C5%82 is character which is in UTF-8 represented by sequence of two
bytes C5 82. Hower this doesn't represents Unicode character U+C582, but
U+0142 (latin small letter l with stroke -- ł).
Only problem is that many applications treat URL not as UTF-8 encoded,
but ISO-8859-1 encoded.
Jirka
--
-----------------------------------------------------------------
Jirka Kosek
e-mail: jirka@xxxxxxxx
http://www.kosek.cz
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|