From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-0.6 required=5.0 tests=DKIM_INVALID,DKIM_SIGNED, HEADER_FROM_DIFFERENT_DOMAINS,HTML_MESSAGE,MAILING_LIST_MULTI autolearn=ham autolearn_force=no version=3.4.4 Received: from minnie.tuhs.org (minnie.tuhs.org [IPv6:2600:3c01:e000:146::1]) by inbox.vuxu.org (Postfix) with ESMTP id 5906E21E30 for ; Mon, 30 Dec 2024 03:08:34 +0100 (CET) Received: from minnie.tuhs.org (localhost [IPv6:::1]) by minnie.tuhs.org (Postfix) with ESMTP id 6A27D42720; Mon, 30 Dec 2024 12:08:28 +1000 (AEST) Received: from fout-a8-smtp.messagingengine.com (fout-a8-smtp.messagingengine.com [103.168.172.151]) by minnie.tuhs.org (Postfix) with ESMTPS id 972E44269E for ; Mon, 30 Dec 2024 12:08:18 +1000 (AEST) Received: from phl-compute-06.internal (phl-compute-06.phl.internal [10.202.2.46]) by mailfout.phl.internal (Postfix) with ESMTP id AF99313801E6 for ; Sun, 29 Dec 2024 21:08:17 -0500 (EST) Received: from phl-mailfrontend-02 ([10.202.2.163]) by phl-compute-06.internal (MEProxy); Sun, 29 Dec 2024 21:08:17 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= techsolvency.com; h=cc:content-type:content-type:date:date:from :from:in-reply-to:in-reply-to:message-id:mime-version:references :reply-to:subject:subject:to:to; s=fm2; t=1735524497; x= 1735610897; bh=Pe2dvh1UhNtcnwiI3nxDINBo9/nv11dejGR6cUFrEkA=; b=L 8QVGzpC0Cm7uQA21NCmtjz0LfgSgdCYz6zHQIZtQjfySh+wAJDZS2iWBA8wc2Abo qEuhZFJO+M8gghRDwYZut2MkmJsIetnkOy8bTilR+rivxIibHnyxlgvHEgl0rBZD om8TkFG8IPetO0/Vmd8QTKcW00BHOWS3mzSql448MJ9I+2I4zY1ECViZDVxVfLp7 UoStM3S5cK1hqrK5e+4pnWUvVq5N6Tm8/pwEWa07SfBIWGG9UTTD4qPNQy5rVOD1 HyOiNC7jzmh4Q7BM8EFBpxgxj0nL/GDMYZpQkF6H6MGOcAeHzaTVI+Vv9yVSt8HO 8HQBtNJwEoy9Ua2taroNA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-type:content-type:date:date :feedback-id:feedback-id:from:from:in-reply-to:in-reply-to :message-id:mime-version:references:reply-to:subject:subject:to :to:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=fm2; t= 1735524497; x=1735610897; bh=Pe2dvh1UhNtcnwiI3nxDINBo9/nv11dejGR 6cUFrEkA=; b=RN2SHLPxYa2L6hw6WHdUklYXzHw56u4KxxX3Ha4m1kcOD2yMo1v sQIN2uFJ+pAkftxi6TQHENgq73LS45iSi68NugGlAVkrXsUx1zURkpMlM8jsvnAs qNIECCMeKqMMdy69ckxkhyYp8rDULe3S7SftJhW5H4xkbQO3JCb6mYtOKKNICtP/ vON/GdktOa+yFUpMDZ4WnFiKEMo9RbqR0p4UKSvvXoxPcpXYrO71uGfKwIzuFxrN Gom6WHxmGdcWNZMTbD5apSARTafiG60ZjotuUgED5snivIiYb99NXzy3xSG/mjbM uqD8RBi4SWgnrz/Obo2RQWJvqkpNFjrb/ug== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeefuddruddvhedggeefucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdggtfgfnhhsuhgsshgtrhhisggvpdfu rfetoffkrfgpnffqhgenuceurghilhhouhhtmecufedttdenucenucfjughrpeggfhgjhf ffkffuvfgtsegrtderredttdejnecuhfhrohhmpeftohihtggvucghihhllhhirghmshcu oehrohihtggvsehtvggthhhsohhlvhgvnhgthidrtghomheqnecuggftrfgrthhtvghrnh epkeejhfeugefffeegteegvdejlefhheeivdffgeekveeftdehleeuieegvdetfedvnecu ffhomhgrihhnpehtuhhhshdrohhrghenucevlhhushhtvghrufhiiigvpedtnecurfgrrh grmhepmhgrihhlfhhrohhmpehrohihtggvsehtvggthhhsohhlvhgvnhgthidrtghomhdp nhgspghrtghpthhtohepuddpmhhouggvpehsmhhtphhouhhtpdhrtghpthhtohepthhuhh hssehtuhhhshdrohhrgh X-ME-Proxy: Feedback-ID: i35904219:Fastmail Received: by mail.messagingengine.com (Postfix) with ESMTPA for ; Sun, 29 Dec 2024 21:08:17 -0500 (EST) Received: by mail-ed1-f49.google.com with SMTP id 4fb4d7f45d1cf-5d3d2a30afcso15692618a12.3 for ; Sun, 29 Dec 2024 18:08:17 -0800 (PST) X-Gm-Message-State: AOJu0YxVcQcsw/OI0vfhFt2o/HRTurvtiySAjvuSYXWauIyMd2ifDR8+ 1LheH9c54QgS8qmK6bLCeopM42BJ8PUKM8ldMAuf65wbfOXo7kScI6SZKJrbDWZ3yFhGpjdbo3y mZa+XtaQP7CuiHKKkIUUKzufgR4w= X-Google-Smtp-Source: AGHT+IHck6tpR9VQGsTU9RR12B+rBIhAIgbaGTRiklgT5UEmoUhq8ckf7hi7/L91gTtPgks6WtpHTqLh4RctDBoC0oI= X-Received: by 2002:a05:6402:1d50:b0:5d3:d4cf:feb5 with SMTP id 4fb4d7f45d1cf-5d81de2dc9cmr28469589a12.29.1735524496209; Sun, 29 Dec 2024 18:08:16 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Royce Williams Date: Sun, 29 Dec 2024 17:07:39 -0900 X-Gmail-Original-Message-ID: Message-ID: To: tuhs@tuhs.org Content-Type: multipart/alternative; boundary="0000000000001881d5062a7349e2" Message-ID-Hash: IN3RWV2U7XGQOTKFBDERMH3JBXRSGX5O X-Message-ID-Hash: IN3RWV2U7XGQOTKFBDERMH3JBXRSGX5O X-MailFrom: royce@techsolvency.com X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-tuhs.tuhs.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header X-Mailman-Version: 3.3.6b1 Precedence: list Subject: [TUHS] Re: "Webster's Second on the Head of a Pin"? List-Id: The Unix Heritage Society mailing list Archived-At: List-Archive: List-Help: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: --0000000000001881d5062a7349e2 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Sun, Dec 29, 2024 at 3:37=E2=80=AFPM Warren Toomey wrote: > On Sat, Dec 28, 2024 at 05:53:48PM -0900, Royce Williams wrote: > > Someone I know is seeking the original version of an internal Bell > Labs > > memo from 1974 titled "Webster's Second on the Head of a Pin" by > Morris > > and Thompson. The topic appears to be related to improving the speed > of > > lookups or search. It's cited in a few papers as "Unpublished > Technical > > Memo, Bell Laboratories, Murray Hill, NJ 1974." All I can find onlin= e > > is citations. Any leads appreciated! > > Doug McIlroy sent me a copy, it's now here: > > > https://www.tuhs.org/Archive/Documentation/TechReports/Bell_Labs/PinheadW= ebster.pdf > > Thanks Doug! > And many thanks from me and my colleague as well, Doug! For future searchers, what follows is selected (unique) front matter from the memo, rewrapped slightly for Mailman width. Title - Webster's Second on the Head of a Pin Date - July 15, 1974 TM - 74-1271-13 Other keywords - words, text compression Author Location Extension Robert Morris MH 2C-524 3878 Ken Thompson MH 2C-523 2394 Charging case - 39199 Filing Case - 39199-11 ABSTRACT We used the list of words from Webster's Second Unabridged Dictionary (without definitions) as a test case for special purpose text compression techniques. We compressed it by a factor of 4.52 to 1. The 234,932 words originally occupied 2,486,781 bytes and were compressed into 549,388 bytes. The size of the decoding program is 1356 bytes. The initial characters of a word that agreed with the initial characters of the previous word were dropped and replaced by a code. Common suffixes were also coded. Finally, a variable-length code was used. Pages Text 6 Other 0 Total 6 --=20 Royce --0000000000001881d5062a7349e2 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

On Sun, = Dec 29, 2024 at 3:37=E2=80=AFPM Warren Toomey <wkt@tuhs.org> wrote:
On Sat, Dec 28, 2024 at 05:53:48PM -0900, Royce Williams= wrote:
>=C2=A0 =C2=A0 Someone I know is seeking the original version of an inte= rnal Bell Labs
>=C2=A0 =C2=A0 memo from 1974 titled "Webster's Second on the H= ead of a Pin" by Morris
>=C2=A0 =C2=A0 and Thompson. The topic appears to be related to improvin= g the speed of
>=C2=A0 =C2=A0 lookups or search. It's cited in a few papers as &quo= t;Unpublished Technical
>=C2=A0 =C2=A0 Memo, Bell Laboratories, Murray Hill, NJ 1974." All = I can find online
>=C2=A0 =C2=A0 is citations. Any leads appreciated!

Doug McIlroy sent me a copy, it's now here:

https://www.tuhs.= org/Archive/Documentation/TechReports/Bell_Labs/PinheadWebster.pdf

Thanks Doug!

And many thanks from me an= d my colleague as well, Doug!

For future searchers= , what follows is selected (unique) front matter from the memo, rewrapped s= lightly for Mailman width.


Title - = Webster's Second on the Head of a Pin
Date - July 15, 1974
TM - 7= 4-1271-13

Other keywords - words, text compression

Author Lo= cation Extension
Robert Morris MH 2C-524 3878
Ken Thompson MH 2C-523 = 2394

Charging case - 39199
Filing Case - 39199-11


= ABSTRACT

We used the list of words from Webster's Second = Unabridged Dictionary
(without definitions) as a test case for special p= urpose text
compression techniques.

We compressed it by a factor= of 4.52 to 1.

The 234,932 words originally occupied 2,486,781 bytes= and were
compressed into 549,388 bytes. The size of the decoding progra= m is
1356 bytes.

The initial characters of a word that agreed wi= th the initial
characters of the previous word were dropped and replaced= by a code.
Common suffixes were also coded. Finally, a variable-length = code was
used.


Pages Text=C2= =A0 =C2=A0 6=C2=A0 =C2=A0 Other=C2=A0 =C2=A00=C2=A0 =C2=A0 Total=C2=A0 =C2= =A06


--=C2=A0
Royce=C2=A0=
--0000000000001881d5062a7349e2--