Re: Specifying a Unicode subset

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

To: gustaf.liljegren@x... (Gustaf Liljegren)
Subject: Re: Specifying a Unicode subset
From: John Cowan <jcowan@r...>
Date: Mon, 21 Oct 2002 12:24:41 -0400 (EDT)
Cc: xml-dev@l...
In-reply-to: <3.0.6.32.20021021180358.0098f730@m...> from "Gustaf Liljegren" at Oct 21, 2002 06:03:58 PM

Gustaf Liljegren scripsit:

> With XML 1.1 (here's my point), there's a proposal to include more
> characters from Unicode in XML. 

In fact, XML 1.1 allows *fewer* characters than XML 1.0, but not ones that
we expect anyone to have used: the characters #x7F-#x9F, with the exception
of #x85.  

> However, some want more characters in XML, while others don't want them.
> Perhaps we can allow for both by letting documents declare their own subset
> of Unicode?

Unicode is rather resistant to the idea of declared subsets.  The conformance
requirement is essentially "Don't corrupt what you don't understand";
explicit transformations are fine, but in general if a particular process
cannot handle a character, it should pass it through unchanged.  (Rendering
is obviously an exception.)

-- 
Business before pleasure, if not too bloomering long before.
        --Nicholas van Rijn
                John Cowan <jcowan@r...>
                        http://www.ccil.org/~cowan  http://www.reutershealth.com

References:
- Specifying a Unicode subset
  - From: Gustaf Liljegren <gustaf.liljegren@x...>

Prev by Date: Re: Specifying a Unicode subset
Next by Date: Re: Specifying a Unicode subset
Previous by thread: Re: Specifying a Unicode subset
Next by thread: Re: Specifying a Unicode subset
Index(es):
- Date
- Thread

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >