equivalence of and et. al. ?

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

To: xml-dev@l...
Subject: equivalence of and et. al. ?
From: Stuart A Yeates <stuart.yeates@c...>
Date: Mon, 23 Feb 2004 20:22:14 +0000
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.6) Gecko/20040122 Debian/1.6-1

I have written a natural language modelling tool which marks up (inserts 
XML tags into) natural language documents already in XML.

I have come across an issue with this tool: some users and documents 
have an expectation that <i><b></b></i> and <b><i></i></b> (and similar 
classes of constructs) are equivalent, whereas my tool sees these are 
completely distinct.

 From looking at at the standards, is appears that HTML, XHTML and XML 
are all silent on the semantics of situations such as this.

Are there any systems or toolkits which have already been written to 
help systematise documents and corpora into a single, consistent 
representation?

cheers
stuart

-- 
Stuart Yeates            stuart.yeates@c...
OSS Watch                                  http://www.oss-watch.ac.uk/
Oxford Text Archive                             http://ota.ahds.ac.uk/
Humbul Humanities Hub                         http://www.humbul.ac.uk/

Prev by Date: Re: Piccolo Java SAX parser and others in the wild?
Next by Date: RE: equivalence of and et. al. ?
Previous by thread: RE: Standards for Search Systems
Next by thread: RE: equivalence of and et. al. ?
Index(es):
- Date
- Thread

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >