9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
From: "Roman V. Shaposhnik" <rvs@sun.com>
To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net>
Subject: Re: [9fans] Plan 9 source history (was: Re: source browsing via	http	is back)
Date: Wed, 11 Feb 2009 11:06:15 -0800	[thread overview]
Message-ID: <1234379175.4957.320.camel@goose.sun.com> (raw)
In-Reply-To: <20090211014302.GP22259@masters6.cs.jhu.edu>

On Tue, 2009-02-10 at 20:43 -0500, Nathaniel W Filardo wrote:
> Incidentally, a git repository of the crawls, from 2002/1212 to 2009/0205,
> is available at http://mirrors.acm.jhu.edu/trees/plan9native/ .  Git gets
> the data down to 165M after a gc run, so perhaps it's a better idea than a
> venti-based mirror.

Where did 165M came from? The history itself seems to be only about 58M
or so:
  $ wget http://mirrors.acm.jhu.edu/trees/plan9native/.git/objects/pack/pack-afe021812ab52f698895941f8eb5ad4e3d75020e.pack
  $ ls -l pack-afe021812ab52f698895941f8eb5ad4e3d75020e.pack
  -rw-rw-r--   1 rs76089  staff    61039150 Feb 11 06:40 pack-afe021812ab52f698895941f8eb5ad4e3d75020e.pack

And, after the following simple minded manipulations:
  $ git init
  $ git unpack-objects < pack*
  $ git checkout -b master 68e58814202bccfbd7186962daedd754ae76d7df
  warning: You appear to be on a branch yet to be born.
  warning: Forcing checkout of 68e58814202bccfbd7186962daedd754ae76d7df.
  Checking out files: 100% (14229/14229), done.
  Already on "master"
  $ git repack -ad --window 100 --depth 100
  Counting objects: 39971, done.
  Compressing objects: 100% (39354/39354), done.
  Writing objects: 100% (39971/39971), done.
  Total 39971 (delta 25278), reused 0 (delta 0)

Made it even smaller (you can fine tune it even more, based on
usage requirements):
  $ ls -l .git/objects/pack/*.pack
  -r--r--r--   1 rs76089  staff    57694396 Feb 11 11:03 .git/objects/pack/pack-afe021812ab52f698895941f8eb5ad4e3d75020e.pack

>  I haven't managed to make my version of Uriel's port
> (thanks for the start! :) ) of git do the right thing in enough cases yet,
> so the git repo may not be updated for a while, but I figured somebody might
> want to play with it in the interim.

The coolest things, of course, would be to have a way of running git on
the bell labs end. But doing a replica and repacking everything locally
is not bad at all.

Thanks,
Roman.




  parent reply	other threads:[~2009-02-11 19:06 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-02-10 18:49 [9fans] source browsing via http is back geoff
2009-02-10 19:02 ` Bruce Ellis
2009-02-10 21:10 ` John Barham
2009-02-10 21:15   ` ron minnich
2009-02-10 21:22     ` Nathaniel W Filardo
2009-02-10 21:32       ` erik quanstrom
2009-02-10 21:51         ` Roman V. Shaposhnik
2009-02-10 21:55           ` erik quanstrom
2009-02-10 22:05             ` Roman V. Shaposhnik
2009-02-10 22:13           ` Nathaniel W Filardo
2009-02-10 22:17             ` Roman V. Shaposhnik
2009-02-10 22:08         ` Nathaniel W Filardo
2009-02-10 22:10           ` erik quanstrom
2009-02-10 22:23             ` Roman V. Shaposhnik
2009-02-10 22:28               ` erik quanstrom
2009-02-10 22:45                 ` Roman V. Shaposhnik
2009-02-11  0:22                   ` Bruce Ellis
2009-02-11  0:28                     ` Roman V. Shaposhnik
2009-02-11  6:06                       ` Bruce Ellis
2009-02-11  0:32                     ` Akshat Kumar
2009-02-11  1:43                   ` [9fans] Plan 9 source history (was: Re: source browsing via http is back) Nathaniel W Filardo
2009-02-11  3:40                     ` erik quanstrom
2009-02-11 18:07                     ` Uriel
2009-02-11 18:19                       ` Venkatesh Srinivas
2009-02-11 18:35                         ` Roman V. Shaposhnik
2009-02-11 18:46                           ` Nathaniel W Filardo
2009-02-12 15:10                       ` Venkatesh Srinivas
2009-02-11 19:06                     ` Roman V. Shaposhnik [this message]
2009-02-12  5:57                 ` [9fans] source browsing via http is back sqweek
2009-02-12 12:49                   ` erik quanstrom
2009-02-12 13:10                     ` Bruce Ellis
2009-02-12 16:19                     ` Roman V. Shaposhnik
2009-02-12 16:28                       ` erik quanstrom
2009-02-12 16:42                     ` Nathaniel W Filardo
2009-02-12 16:50                       ` andrey mirtchovski
2009-02-12 16:56                         ` Nathaniel W Filardo
2009-02-12 16:58                         ` erik quanstrom
2009-02-12 17:20                         ` Bruce Ellis
2009-02-12 16:52                       ` erik quanstrom
2009-02-10 22:27       ` Nathaniel W Filardo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1234379175.4957.320.camel@goose.sun.com \
    --to=rvs@sun.com \
    --cc=9fans@9fans.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).