[Home] [By Thread] [By Date] [Recent Entries]

  • From: Eric Lemond <elemond@n...>
  • To: Lorne Harwood <lorneharwood@h...>
  • Date: Thu, 01 Nov 2001 14:12:13 -0700

Lorne,
The load speed we see is generally between 50K to 250K per second per
thread.  The person who loaded GenBank is on vacation, so I don't have
an exact load time. The exact load rate will vary based on the size of
the docs, memory and system configuration.  Keep in mind that the rate
includes the equivalent of full indexing.  Wish I could give you
specifics, but I hope this is helpful.
Eric Lemond

-----Original Message-----
From: Lorne Harwood [mailto:lorneharwood@h...] 
Sent: Thursday, November 01, 2001 1:10 PM
To: Eric Lemond
Cc: xml-dev@l...
Subject: RE:  XML Database Decision Tree?

Eric,

How long did it take to load/index the 44.GB?

Lorne


I can give a real example of loading data into NeoCore XMS.  This was a
project one of our engineers did to prove that we could handle huge data
sets.  It's not meant as a benchmark, but to illustrate data loading
simplicity.

We got a copy of the 44.1 GB GenBank of genomics research.  We converted
the documents to XML with a small Perl script.  Each document is an
average of 200 MB in size.

Using the command:  neoxmlutils import [config dir location] [import
dir]

The resulting database footprint was 34.4 GB (<80% the size of the
original data).  You don't have to create any indexes.  With our pattern
processing technology, the database is fully indexed.

Just wanted to confirm how easy it is to load data into an XML Database.

Eric Lemond


_________________________________________________________________
Get your FREE download of MSN Explorer at
http://explorer.msn.com/intl.asp


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member