From mboxrd@z Thu Jan 1 00:00:00 1970 Message-Id: <200201300547.PAA17510@hadrian.staff.apnic.net> From: ggm@apnic.net (George Michaelson) To: 9fans@cse.psu.edu Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Subject: [9fans] venti Date: Wed, 30 Jan 2002 15:35:00 -0500 Topicbox-Message-UUID: 48f60bec-eaca-11e9-9e20-41e7f4b1d025 Say you co-elesce the same blockset down to one common instance for a large number of files, and then lose that block because a fly shits on the write-once media. Didn't you just lose that bitpattern in time, across your entire archival filestore? Oooh. me no like. I suppose that is exactly what happens to 99% of dataloss anyway, if you believe its mostly unique data. This is precicely why holotype fossils are not on display: they are too valuable to expose to risk of catastrophic loss. Protection of an entire venti filestore would thus mean either 2x the data size to replicate, or some sub-fraction to ECC check it, for some chosen level of protection. I suppose thats still better than having 300 repeat instances of the BSD copyright on every manpage. I notice in the paper the authors say "doable, but we didn't bother" which means until somebody in the operations space does budget-up for the offsite archive, and some checks to make sure the copy is 1:1 acceptable, the venti filestore is a bit more risky than you might want. Even backed on RAID5, it can lose bigtime from some failures. Might be nice for @home fs, on IDE raid mind you! This also looks to have resonances with rsync, in a data persistant state instead of for update of two copies of the same resource. The rsync papers discuss finding good hash algs and good blocksizes to make this work. No citation, perhaps if you followed the rsync citation index back they'd wind up in the same prime sources. I guess the venti authors knew about rsync but don't see it as having relevance. Do you get nice COW form savings in memory/load time because every pointer can walk through the same memory reference to find the string? That could be really nice! And, it would make smaller footprints on tiny devices as well. cheers -George