[Home] [By Thread] [By Date] [Recent Entries]


Your data is not UTF-8.   It is probably the Windows Latin 1 code page, a.k.a "ANSI" a.k.a CP-1252.  

The SAX parser is correct to complain. Correct the encoding declaration to "WINDOWS-1252"
which is the preferred name on the Internet.  


Cheers
Rick Jelliffe


----- Original Message ----- 
From: "Malligeswari N" <malliga@d...>
To: <xml-dev@l...>
Sent: Tuesday, June 03, 2003 5:02 PM
Subject:  Urgent help in XML parser


Hi All,
     I'm using SAX parser. My xml document has encoding style : 'UTF-8'.

     My inputdata looks like this -
                    <DATA_DESCRIPTION><![CDATA[ TODAY'S
DATE ]]></DATA_DESCRIPTION>

    My parser throws a errors while parsing this particular character " '
" - apos.
                " java.io.UTFDataFormatException: invalid byte 1 of 1-byte
UTF-8 sequence (0x92)
             void
org.apache.xerces.parsers.StandardParserConfiguration.parse(org.apache.xerce
s.xni.parser.XMLInputSource)
             void
org.apache.xerces.parsers.XMLParser.parse(org.apache.xerces.xni.parser.XMLIn
putSource)

             void
org.apache.xerces.parsers.AbstractSAXParser.parse(org.xml.sax.InputSource)
..."

   Pl. let me know how to solve this...

Thanks and Regards,

Malligen.





Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member