9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
From: Nathaniel W Filardo <nwf@cs.jhu.edu>
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Subject: Re: [9fans] source browsing via http is back
Date: Thu, 12 Feb 2009 11:42:47 -0500	[thread overview]
Message-ID: <20090212164247.GV22259@masters6.cs.jhu.edu> (raw)
In-Reply-To: <a1d5e8b5186999a3a4660d7730f250f0@quanstro.net>

[-- Attachment #1: Type: text/plain, Size: 2693 bytes --]

On Thu, Feb 12, 2009 at 07:49:58AM -0500, erik quanstrom wrote:
> exactly.  the point i was trying to make, and evidently
> was being too coy about, is that 330 odd gb wouldn't
> be as useful a number as the sum of the sizes of all the
> new/changed files from all the dump days.  this would
> be a useful comparison because this would give a
> measure of how much space is saved with venti over
> the straightforward algorithm of copying the changed
> blocks, as ken's fileserver does.

Unless I misunderstand how replica works, the 330 odd GB number [1] is
useful as the amount of data that would have to be transfered over the wire
to initialize a mirror.  (Since, as I understand it, a replica log of
sourcesdump would have nothing but "add" commands for each $year/$dump/$file
entry, and would therefore necessitate transfering each file separately).

On the other hand, it's entirely possible that I'm missing some feature of
replica, or that some set of wrapper scripts around it would suffice.  If
so, please excuse, and correct, my ignorance.

On the first hand again, given the occasional reports of "replica hosed me"
I'm not terribly keen on trusting it and seem to recall that some of the
fixes have involved hand-editing the replica logs on sources.  This makes me
suspicious that some of the replica logs frozen in sourcesdump would be
incorrect and lead to incorrect data on mirrors if used as part of the
scheme.

With a venti & vac (auth/none vac, naturally, so as to not violate
filesystem permissions) based mirror, there's a single score published daily
that covers the entirety of sourcesdump so far, and a venti/copy -f
sufficies to bring any mirror up to date using at most 550 odd MB if the
initial mirror is empty. [2]

--nwf;

[1] The discrepency between 550 MB and 330 GB increases as time goes on and
as the slice of sources being mirrored goes from "just some source files
that some schmo thought would be nice to mirror" to "all of it".

[2] Further, 9fs access to sources is grand, but it does take me 10 to 15
minutes to pull down a day's worth of "just some source files", even if
nothing has changed and I uses vac -f, due to all the network latency for
Tstat/Rstat requests.  This could be improved in a number of ways, but it
strikes me as simpler to use venti/copy to copy only the incremental deltas.
Some brief experiments, transfering blocks from Baltimore back to a machine
in the same neighborhood as sources, indicate that venti/copy -f takes 15
minutes for the first copy (2002/1212) and that subsequently copying even a
dump with many changes (2008/0901) took only four.  (Git may do even
better.)

[-- Attachment #2: Type: application/pgp-signature, Size: 204 bytes --]

  parent reply	other threads:[~2009-02-12 16:42 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-02-10 18:49 geoff
2009-02-10 19:02 ` Bruce Ellis
2009-02-10 21:10 ` John Barham
2009-02-10 21:15   ` ron minnich
2009-02-10 21:22     ` Nathaniel W Filardo
2009-02-10 21:32       ` erik quanstrom
2009-02-10 21:51         ` Roman V. Shaposhnik
2009-02-10 21:55           ` erik quanstrom
2009-02-10 22:05             ` Roman V. Shaposhnik
2009-02-10 22:13           ` Nathaniel W Filardo
2009-02-10 22:17             ` Roman V. Shaposhnik
2009-02-10 22:08         ` Nathaniel W Filardo
2009-02-10 22:10           ` erik quanstrom
2009-02-10 22:23             ` Roman V. Shaposhnik
2009-02-10 22:28               ` erik quanstrom
2009-02-10 22:45                 ` Roman V. Shaposhnik
2009-02-11  0:22                   ` Bruce Ellis
2009-02-11  0:28                     ` Roman V. Shaposhnik
2009-02-11  6:06                       ` Bruce Ellis
2009-02-11  0:32                     ` Akshat Kumar
2009-02-11  1:43                   ` [9fans] Plan 9 source history (was: Re: source browsing via http is back) Nathaniel W Filardo
2009-02-11  3:40                     ` erik quanstrom
2009-02-11 18:07                     ` Uriel
2009-02-11 18:19                       ` Venkatesh Srinivas
2009-02-11 18:35                         ` Roman V. Shaposhnik
2009-02-11 18:46                           ` Nathaniel W Filardo
2009-02-12 15:10                       ` Venkatesh Srinivas
2009-02-11 19:06                     ` Roman V. Shaposhnik
2009-02-12  5:57                 ` [9fans] source browsing via http is back sqweek
2009-02-12 12:49                   ` erik quanstrom
2009-02-12 13:10                     ` Bruce Ellis
2009-02-12 16:19                     ` Roman V. Shaposhnik
2009-02-12 16:28                       ` erik quanstrom
2009-02-12 16:42                     ` Nathaniel W Filardo [this message]
2009-02-12 16:50                       ` andrey mirtchovski
2009-02-12 16:56                         ` Nathaniel W Filardo
2009-02-12 16:58                         ` erik quanstrom
2009-02-12 17:20                         ` Bruce Ellis
2009-02-12 16:52                       ` erik quanstrom
2009-02-10 22:27       ` Nathaniel W Filardo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090212164247.GV22259@masters6.cs.jhu.edu \
    --to=nwf@cs.jhu.edu \
    --cc=9fans@9fans.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).