[Home] [By Thread] [By Date] [Recent Entries]


Consider this document, encoded in UTF-8 with no BOM:

<?xml version="1.0"?>
<ä/>

Is there a safe way for a non-XML-aware text editor to find out that this
file is using UTF-8?

There are still a lot of people over here who likes to use ISO 8859-1,
because they have the conception that 'ä' is written 'ä' in UTF-8. I was
about to tell one that it's just his editor that's broken, but then I came
to think about this: maybe there isn't a good way for a general text editor
to know about the UTF-8 encoding? Maybe the EF BB BF signature should have
been made mandatory?

I guess there's something I overlook. Can someone explain?

Gustaf

WordPad.lnk


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member