[Home] [By Thread] [By Date] [Recent Entries]

  • From: Eric Bohlman <ebohlman@e...>
  • To: Raffaele Bello <raffaele.bello@e...>, xml-dev@l...
  • Date: Tue, 11 Sep 2001 03:18:32 -0500

9/11/01 2:30:55 AM, Raffaele Bello <raffaele.bello@e...> wrote:

>   when I try to parse an xml document with character like "è" i got the
>   following error:

That's a character in the ISO-8859-1 encoding

>   org.xml.sax.SAXParseException: Character conversion error: "Malformed
>   UTF-8 char

And you get that error because the parser is expecting a character in the UTF-8 encoding.

>   The strings in the xml document are in Italian, maybe I should use a
>   different encoding?
>
>
>   I'd use
>
>   <?xml version = "1.0" encoding = "UTF-8"?>

You're telling the parser that your characters are encoded in UTF-8, but in reality they're encoded 
in ISO-8859-1.  Those two encodings have different values for all characters outside the ASCII 
subset.  If your documents are encoded in anything other than UTF-8, you need to specify the correct 
encoding in your XML declaration.



Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member