Subject: Re: XSLT script to report Unicode characters and code blocks in file?
From: Colin Paul Adams <colin@xxxxxxxxxxxxxxxxxx>
Date: Fri, 30 May 2008 13:17:59 +0100
|
>>>>> "Michael" == Michael Kay <mike@xxxxxxxxxxxx> writes:
Michael> Incidentally, the current version of blocks.txt does not
Michael> exactly match the names of the blocks as defined in
Michael> schema (and XPath) regular expressions, for example
Michael> 0370..03FF was once "Greek" but is now "Greek and
Michael> Coptic". The Schema WG is close to deciding that the
Michael> Unicode names are definitive, which means that regular
Michael> expressions become invalid when Unicode decides to change
Michael> the names of the blocks...
I think that's reasonable.
And I don't think that block names should be used for anything other
than saying which block a character belongs to.
The blocks have no significance other than how Unicode allocates
codepoints, I think.
--
Colin Adams
Preston Lancashire
|