On 1/19/06, dan@xxxxxxxxxxxxx <dan@xxxxxxxxxxxxx> wrote:
> For my application, people have to upload an XML file (their iTunes Music
> Library). After it uploads, I run a transformation on it in PHP5. Many
> transformations are not working and showing errors because of weird
> characters in the source XML. The source and result XML are set to UTF-8.
> Is there a way to skip over a troubled node? I can't imagine that XLST
> doesn't have a way to deal with this.
>
The last time I checked Unicode support in PHP is pretty poor. Are
you sure the XML/XSLT libraries in PHP can actually handle the Unicode
characters? Last time I dealt with them (it was a while ago) I had to
normalize text before it went into the XML parser in PHP. At the time
I looked into using a wrapper around some Java programs but in the end
it was more reliable to filter out the few Unicode characters that did
appear in our input.
Jon Gorman
|