[Home] [By Thread] [By Date] [Recent Entries]


Rich Salz wrote:

> He goes a bit overboard, but Mark Pilgrim makes some good points:
>    http://diveintomark.org/archives/2004/01/14/thought_experiment
> (What, Mark going overboard?  I'm shocked I say, shocked.)  Don't lose 
> his point:
> it's not necessarily invalid markup, but different character 
> encodings, too.

The problem is an architectural one: if you have feeds coming in from 
sources
out of your control, you need to have a fallback mechanism for when
feeds come in incorrectly, whether incorrect beause of bogus encodings 
or crap
markup.

If your application has not been coded for errors, of course it will be 
fragile.

People who think documents should allow pieces of different encodings are
thinking of the English case where characters in different encodings are 
probably
decorations or prissiness: but if I have half of a generated HTML page 
in Japanese
Shift JIS and half in Japanese EUC-J somehow, which one should be rendered
correctly? Either way the page is crap, and the users will complain.

Underneath Mark's issue is the same old one: that there are serious systems
and casual systems, and the disciplines required for one is burdensome for
the other.

Cheers
Rick Jelliffe


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member