I've spent more time than I care to admit loading, dumping,reloading, tranforming, testing, reloading...various wikipediadatabases before settling on what I think the new format willbe, but I made a discovery along the way that might be useful:The 05/20 database dump from wikipedia weighs in at close to600 MB. It turns out that almost 200 MB of that is cache. Inthe new system, I'll write a function specifically for doingdatabase dumps, but in the meantime I'd suggest that the nexttime you dump a tarball, clear the cache first (and don'tforget to be careful of the timestamps when you do).0