The characters E2 80 99 are the UTF-8 encoding of the Unicode character
RIGHT SINGLE QUOTATION MARK.
Simply changing the ISO-8859-1 in your XML file to UTF-8 should fix this.
On 11 October 2016 at 21:00, Bridger Dyson-Smith bdysonsmith@xxxxxxxxx <
xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote:
> Hi all,
>
> I'm struggling with a character encoding issue (or a character
> representation issue maybe?): I have input XML that looks like this
>
> input.xml
> <?xml version="1.0" encoding="iso-8859-1"?>
> <documents>
> <document>The reality of the effect of natural ventilation in a
> residential attic cavity has been the topic of many debates and scholarly
> reports since the 1930C"b,b"s.</document>
> </documents>
>
> and I would like to get it to a point where the characters are represented
> properly, i.e.
>
> output.xml
> <?xml version="1.0" encoding="UTF-8"?>
> <documents>
> <document>The reality of the effect of natural ventilation in a
> residential attic cavity has been the topic of many debates and scholarly
> reports since the 1930bs.</document>
> </documents>
>
> Thanks to Liam's help on irc and reading through the list archives, it
> seems like an identity transform should be the right step towards getting
> the representation corrected, but something isn't working (or I have a
> misunderstanding somewhere).
>
> If I apply the following identity transform with Saxon HE 9.6.0.7 in
> oXygen 18:
> <?xml version="1.0" encoding="UTF-8"?>
> <xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
> version="2.0">
> <xsl:output encoding="UTF-8" indent="yes"/>
> <xsl:template match="/"><xsl:copy-of select="/"/></xsl:template>
> </xsl:stylesheet>
>
> I get the following result:
> <?xml version="1.0" encoding="UTF-8"?>
> <documents>
> <document>The reality of the effect of natural ventilation in a
> residential attic cavity has been the topic of many debates and scholarly
> reports since the 1930C"€™s.</document>
> </documents>
>
> Could someone provide some insight into what I've done wrong here? Any
> help would be greatly appreciated.
>
> Best,
> Bridger
>
> XSL-List info and archive <http://www.mulberrytech.com/xsl/xsl-list>
> EasyUnsubscribe <-list/528976> (by
> email <>)
|