hashing

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

To: xml-dev@l...
Subject: hashing
From: Eric Hanson <eric@a...>
Date: Thu, 29 Apr 2004 19:58:17 +0000
User-agent: Mutt/1.2i

I have a large collection of XML documents, and want to find and
group any duplicates.  The obvious but slow way of doing this is
to just compare them all to each other.  Is there a better
approach?

Particularly, is there any APIs or standards for "hashing" a
document so that duplicates could be identified in a similar way
to what you'd do with a hash table?

Thanks,
Eric

Follow-Ups:
- Re: hashing
  - From: "Jeff Greif" <jgreif@a...>
- Re: hashing
  - From: David Megginson <dmeggin@a...>

Prev by Date: RE: ISO and the Standards Golden Hammer (was Re: You call that a standard?)
Next by Date: WAY OFFTOPIC: ( RE: ISO and the Standards Golden Hammer (was Re: [xml-d ev] You call that a standard?))
Previous by thread: RE: ISO and the Standards Golden Hammer (was Re: [xml-d ev] You call that a standard?)
Next by thread: Re: hashing
Index(es):
- Date
- Thread

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >