[Home] [By Thread] [By Date] [Recent Entries]

  • To: David Carlisle <davidc@n...>, ht@c...
  • Subject: Re: A heavier-weight proposal for character entitydefinition
  • From: James Clark <jjc@j...>
  • Date: Thu, 07 Feb 2002 08:36:27 +0700
  • Cc: xml-dev@l...

Interesting.  Those are compelling use cases but this significantly 
complicates things.  In particular, automatically using entities on output 
becomes much more complicated.  Instead of a simple hash table that maps 
character codes to entities, you have to have a trie.  I also see a 
slippery slope opening up here:

1. single character
2. base character + combining character(s)/other Unicode modifier (MathML)
3. arbitrary sequence of characters (why limit 2? don't want to check 
character types)
4. arbitrary well-formed content (3 allows arbitrary text, and for I18N 
arbitrary text needs elements for eg BIDI and ruby)

Not clear what the right place to draw the line is here.

--On 06 February 2002 15:31 +0000 David Carlisle <davidc@n...> wrote:

>
>  <character name="eacute" code="xC9"/>
>
> A large number of the MathML entities do not map to a single unicode
> code point but two (or in a few cases three). Common cases the second
> character being a combining character (eg the negation / ) or the new
> math variant character VS1 (to be) introduced in Unicode 3.2.
> eg the ISOASMN nlE (not less than) would seem to map to
> U02A7D U00338
> where U02A7D is LESS-THAN OR SLANTED EQUAL TO (unicode 3.2)
> and   U00338 is COMBINING LONG SOLIDUS OVERLAY
>
> so probably code would need to take a sequence of decimal or hex values,
> not just a single one.
>
> David
>
> _____________________________________________________________________
> This message has been checked for all known viruses by Star Internet
> delivered through the MessageLabs Virus Scanning Service. For further
> information visit http://www.star.net.uk/stats.asp or alternatively call
> Star Internet for details on the Virus Scanning Service.
>
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
> initiative of OASIS <http://www.oasis-open.org>
>
> The list archives are at http://lists.xml.org/archives/xml-dev/
>
> To subscribe or unsubscribe from this list use the subscription
> manager: <http://lists.xml.org/ob/adm.pl>
>
>
>
>

    

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member