[Home] [By Thread] [By Date] [Recent Entries]

  • To: 'Sean McGrath' <sean.mcgrath@p...>, xml-dev@l...
  • Subject: RE: An alternative formulation of the document-centric/data-centric XML divide
  • From: "Bullard, Claude L (Len)" <len.bullard@i...>
  • Date: Thu, 3 Jun 2004 09:08:12 -0500

Right.  As the precision of a semantic definition increases, 
its frequency decreases.  A box is a box is a box is a box.
A part number is not.  Also related to Boltzman entropy.

This relates directly to Roger's thread and population 
and evolution.  The billion dollar question is:  so but 
so what?   Do paragraph tags evolve?  Can they?  What 
about semantic tags?

A <p> can become a member of almost any language. 
A <partno> can't.  What does that say about a
namespace partitioning of <p>?  Is it gratuitous? 
Were the arch forms guys right?

Did the <p> evolve/speciate into the <div>?

Thanks Sean.  That's useful.  If you want to 
have more fun, plot it against users and 
semantic implementation (the three dimensional 
view).  The results should be totally unsurprising.

len


From: Sean McGrath [mailto:sean.mcgrath@p...]

Document-centric XML:
	XML in which corpora conforming to schema X, exhibit power law 
distributions of the element types in X.

Data-centric XML:
	XML in which corpora conforming to schema X, exhibit uniform
distributions 
of the element types in X.

Not perfect but useful nonetheless I think. Mixed content is missing for a 
start.

Anyway, please take a look at the graphs at:
	
http://seanmcgrath.blogspot.com/2004_05_23_seanmcgrath_archive.html#10857620
2776583412 


I'd be very interested in seeing other peoples graphs of the tag-share of 
their XML corpora.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member