Subject: RE: Encoding problem
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Wed, 31 Aug 2005 12:17:21 +0100
|
That suggests that the encoding is indeed Windows CP-1252. x94 in CP-1252
represents the Unicode character x201D, right double quotation mark.
This suggests that if you change the XML declaration of the file to say
encoding="cp1252", and run it through a parser that recognizes that encoding
(not all do, necessarily) then the correct data should be passed to your
application.
Michael Kay
http://www.saxonica.com/
> -----Original Message-----
> From: amita rathore [mailto:amita_rathore@xxxxxxxxx]
> Sent: 31 August 2005 12:01
> To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
> Subject: RE: Encoding problem
>
> I opened the file using UltraEdit. The sequence shows
> as 94 for the trouble character.
>
> --- Michael Kay <mike@xxxxxxxxxxxx> wrote:
>
> > > > and (b)
> > > > the sequence of hex octets used to encode it.
> > > How can I find it ?
> > >
> >
> > Open the file in a hex editor. I use UltraEdit. This
> > is an essential tool
> > for tackling encoding problems: you can't rely on
> > the characters seen on the
> > screen to tell you what the octets in the file are.
> >
> > Michael Kay
> > http://www.saxonica.com/
> >
> >
>
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam? Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
|