9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
* [9fans] fossil/venti/manber
@ 2003-04-28 20:17 FODEMESI Gergely
  2003-04-28 21:55 ` Russ Cox
  0 siblings, 1 reply; 3+ messages in thread
From: FODEMESI Gergely @ 2003-04-28 20:17 UTC (permalink / raw)
  To: plan9 mailing list

Hi,

 in the venti paper there is a reference to manber's algorithm, for
possible future development to venti/fossil. (Udi Manber: Finding similar
files in a large file system)

Did anyone consider giving this possible development a second thought?

I'd like to elaborate on the possibility of using this algorithm with
venti.
Could somebody correct me if the following comments are false?

1. Anchors would be needed to synchronize to block boundries.

2. In order to somehow detect possible similar bit-streams, venti must
know more about the meta information on these similar bit-streams
(files/directories). By this I mean venti format has to be extended with
meta information on files.

3. Venti would have to implement a method of generating possible anchors
to possible similar bit-streams to "new" (i.e. freshly stored)
bit-streams. This should probably be done parallel to storing new blocks.
"Lazy anchoring?"

4. Except for databases with dynamically changing sizes (are there any?),
what kind of bit-streams could such a method be used for?

5. Depending on the comments to 4. could anybody imagine changing venti
format in order to provide such a seemingly marginally useful feature? See
Russ's comment on possibly never changing the venti format.

 thanks for listening: gergo


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-04-28 22:13 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-04-28 20:17 [9fans] fossil/venti/manber FODEMESI Gergely
2003-04-28 21:55 ` Russ Cox
2003-04-28 22:13   ` William Josephson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).