[Home] [By Thread] [By Date] [Recent Entries]


>But any file that's in ebcdic encoding will anyway have to have
>an encoding declaration and the parser, if it understands ebcdic at all,
>will have to map everything to unicode, so there's nothing stopping
>ebcdic new-line being used with XML 1.0 as white space in ebcdic encoded
>XML files so long as the parsers map NEL to #10.

Certainly ebcdic-aware implementations can map NEL to #10, but...

>It is unnatural to
>allow #85 as white space in XML as (currently at least) it isn't as far
>as I know an end of line character in any ascii/unicode based system.

... this would mean that you could parse ebcdic files, but not ebcdic
files that had been translated to Unicode in the standard way.

>At least NEL was flagged as being considered in the original
>requirements doc and is a single byte in utf8.

Um, no it isn't.  Only 0x7F are single bytes.  It's a single byte
in Latin-N.

-- Richard

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member