[Home] [By Thread] [By Date] [Recent Entries]


On Thu, 5 May 2005 1:23 am, Bullard, Claude L (Len) wrote:
> Yes, VSM is a form of document indexing and classification.
> It uses term frequency to create similarity metrics, typically
> a cosine for the angle between terms which normalizes the
> distance.  There are a LOT of papers you can read freely
> available simply by entering "vector space model" into
> that ever-loving simple box that does such a good job for
> cases where SQL falls on its bum.  No structure == no SQL.
> ....

oh ok. I'm clear on that now. I actually worked on
such technologies in the '90s for a while before all the
buzzwords got changed. It's interesting stuff.

> Again, the problem of XML is apriori classification.

Of course, xml needs a good classification system to
be truly effective in a business context.

but I guess as Ken says, it's all wrapped into the
operating system these days. (Pity though it is an
undocumented feature)

David


-- 
Computergrid : The ones with the most connections win.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member