From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RCVD_IN_DNSWL_NONE autolearn=ham autolearn_force=no version=3.4.2 Received: from minnie.tuhs.org (minnie.tuhs.org [45.79.103.53]) by inbox.vuxu.org (OpenSMTPD) with ESMTP id 07f7d121 for ; Sun, 1 Dec 2019 01:26:12 +0000 (UTC) Received: by minnie.tuhs.org (Postfix, from userid 112) id 9D68D9BFF7; Sun, 1 Dec 2019 11:26:10 +1000 (AEST) Received: from minnie.tuhs.org (localhost [127.0.0.1]) by minnie.tuhs.org (Postfix) with ESMTP id 53AD29BC25; Sun, 1 Dec 2019 11:25:32 +1000 (AEST) Received: by minnie.tuhs.org (Postfix, from userid 112) id 852199BC25; Sun, 1 Dec 2019 11:25:29 +1000 (AEST) Received: from central.weird.com (unknown [198.96.117.51]) by minnie.tuhs.org (Postfix) with ESMTP id 3625C94BF4 for ; Sun, 1 Dec 2019 11:25:27 +1000 (AEST) Received: from (invalid client hostname: bind: DNS error: Unknown host)more.local ((no PTR matching greeting name)S01060026bb6c284e.ok.shawcable.net[24.71.254.93] port=41497) by central.weird.com([198.96.117.51] port=587) via TCP with esmtp (4593 bytes) (sender: ) (ident using UNIX) id for ; Sat, 30 Nov 2019 20:25:23 -0500 (EST) (Smail-3.2.0.122-Pre 2005-Nov-17 #1 built 2017-Oct-15) Received: from (invalid client hostname: the DNS A record (with the targegt address [10.0.1.129]) for the hostname 'more.local' does not match the expected address [10.0.1.129])more.local ((no PTR matching greeting name)future.local[10.0.1.133] port=60094) by more.local([10.0.1.129] port=25) via TCP with esmtp (4118 bytes) (sender: ) id for ; Sat, 30 Nov 2019 17:25:22 -0800 (PST) (Smail-3.2.0.122-Pre 2005-Nov-17 #1 built 2015-Feb-17) Message-Id: Date: Sat, 30 Nov 2019 17:25:22 -0800 From: "Greg A. Woods" To: The Unix Heritage Society mailing list In-Reply-To: <20191129215258.Vgu-C%steffen@sdaoden.eu> References: <20191129215258.Vgu-C%steffen@sdaoden.eu> User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM/1.14.9 (=?ISO-8859-4?Q?Goj=F2?=) APEL/10.8 EasyPG/1.0.0 Emacs/25.3 (x86_64--netbsd) MULE/6.0 (HANACHIRUSATO) X-Face: ; j3Eth2XV8h1Yfu*uL{<:dQ$#E[DB0gemGZJ"J#4fH*][ lz; @-iwMv_u\6uIEKR0KY"=MzoQH#CrqBN`nG_5B@rrM8,f~Gr&h5a\= List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: The Unix Heritage Society mailing list Errors-To: tuhs-bounces@minnie.tuhs.org Sender: "TUHS" At Fri, 29 Nov 2019 22:52:58 +0100, Steffen Nurpmeso wrote: Subject: Re: [TUHS] another conversion of the CSRG BSD SCCS archives to Git > > Greg A. Woods wrote in : > |I've been fixing and enhancing James Youngman's git-sccsimport to use > |with some of my SCCS archives, and I thought it might be the ultimate > |stress test of it to convert the CSRG BSD SCCS archives. > | > |The conversion takes about an hour to run on my old-ish Dell server. > | > |This conversion is unlike others -- there is some mechanical compression > |of related deltas into a single Git commit. > | > |https://github.com/robohack/ucb-csrg-bsd > > Thanks for taking the time to produce a CSRG repo that seems to > mimic changesets as they really happened. As i never made it > there on my own, i have switched to yours some weeks ago. (Mind > you, after doing "gc --aggressive --prune=all" the repository size > has more than halved, it was the final reason to prepare new > repositories on a vhost with good internet connection before > getting this through my flaky wifi here. Storage and internet > bandwidth and their cost really do not seem to bother anyone > anymore. I have no offense in mind, i only recognized it (the > hard way).) Ah! I did indeed forget the "git gc" step that many conversion guides recommend. I might change the import script to do that automatically, particularly if it has also initialised the repository in the same run. Apparently github themselves run it regularly: https://stackoverflow.com/a/56020315/816536 Probably they do this by configuring "gc.auto" in each repository, though I've not found any reference to what they might configure it to. However it seems that without the "--aggressive" option, nothing will be done in this repository. With it though I go from 316M down to just 71M. I don't see any way to force/tell/ask github to run "git gc --aggressive". Perhaps I can just delete it from github and immediately re-create it with the re-packed repository, and in theory all the hashes should stay the same and any existing clones should be unaffected. What do you think? Note I have some thoughts of re-doing the whole conversion anyway, with with more ideas on to dealing with "removed" files (SCCS files renamed to the likes of "S.foo") and also including the many files that were never checked into SCCS, perhaps even on a per-release basis, thus being able to create release tags that can be checked out to match the actual releases on the CDs. But this will not happen quite so soon. -- Greg A. Woods Kelowna, BC +1 250 762-7675 RoboHack Planix, Inc. Avoncote Farms