>I've got XML documents, marked up to a DTD, and calling character entity
>sets. When I run through the XSLT processor (xalan) to output another XML
>file I find the entities have been converted to something different, and
>fairly inconsistently.
>
>What I would like to achieve is having “ ü in my input xml, and
>these entities still being untouched in my output. Can anyone advise how I
>achieve this please?
>
>What I'm getting are (“, ü), or (“ and ü), or (“
>and ü), depending on character encoding settings and entity sets used. Am I
>missing something?
>
“ ü are not predefined character entities.
http://www.w3.org/TR/REC-xml#sec-predefined-ent
They apear as literal text strings
'&' 'l' 'd' 'q' 'u' 'o' ';'
and so when searialized to XML the '&' character is replaced by '&' giving
“
if you are making an HTML document and want these character entities you should specify the correct character entity and put.
<xsl:output method="html" version="1.0" encoding="ISO-8859-1">
Edward Middleton
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|