[Home] [By Thread] [By Date] [Recent Entries]

  • To: David Carlisle <davidc@n...>
  • Subject: Re: UTF-8+names
  • From: Tim Bray <tbray@t...>
  • Date: Sun, 19 Oct 2003 11:54:22 -0700
  • Cc: elharo@m..., xml-dev@l...
  • In-reply-to: <200310191630.RAA23060@e3000>
  • References: <200310191630.RAA23060@e3000>
  • User-agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US; rv:1.5) Gecko/20031007

David Carlisle wrote:

>>You're partially right, &lt; is defined by old-fashioned (SGML-based) 
>>HTML but it's *not* defined by XHTML or MathML.  That's because it's 
>>wired into XML so XHTML gets it for free.  So it is *definitely* not 
>>defined in UTF-8+names, and the I-D should say that.
...
> Either way it should be made clarer. I think that for apos and quot
> at least my reading would be more useful as these would then expand to
> the characters which means they are useable in non xml contexts.

Well, there's no doubt that +names is optimized for the needs of XML 
users, in that it defines lots of things like &eacu; but *doesn't* 
define the XML magic 5; this means that &lt; and &amp; and so on go 
through untouched, which is what you need for the purposes of XML users.

For the purposes of XML users, +names really needs to not define &lt; 
and &amp; to be useful for them.  Not defining &apos; and &quot; and 
&gt; is not crucial but it's certainly handy.  I would argue that for 
most non-XML-users, having these things passed through untouched isn't 
really a problem, because normally these are not characters you need to 
escape except in the case that you plan on using the text in XML.

That is to say, I can see non-XML applications finding it handy to be 
able to say "Martin D&uuml;rst said so." but I don't see a common use 
case for "Clearly in this case Ir &lt; Is", unless you're headed for XML.

So I think the current behavior - not replacing the magic five - hits a 
sweet spot. -Tim



Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member