From: Royce Williams <royce@techsolvency.com>
To: tuhs@tuhs.org
Subject: [TUHS] Re: "Webster's Second on the Head of a Pin"?
Date: Sun, 29 Dec 2024 17:07:39 -0900 [thread overview]
Message-ID: <CA+E3k90PXbXt1tAGwovtnAWv6TK6gT9=ot6tW97jzKW3wA_YRg@mail.gmail.com> (raw)
In-Reply-To: <Z3HrQ24VdAiT9cxz@minnie.tuhs.org>
[-- Attachment #1: Type: text/plain, Size: 1859 bytes --]
On Sun, Dec 29, 2024 at 3:37 PM Warren Toomey <wkt@tuhs.org> wrote:
> On Sat, Dec 28, 2024 at 05:53:48PM -0900, Royce Williams wrote:
> > Someone I know is seeking the original version of an internal Bell
> Labs
> > memo from 1974 titled "Webster's Second on the Head of a Pin" by
> Morris
> > and Thompson. The topic appears to be related to improving the speed
> of
> > lookups or search. It's cited in a few papers as "Unpublished
> Technical
> > Memo, Bell Laboratories, Murray Hill, NJ 1974." All I can find online
> > is citations. Any leads appreciated!
>
> Doug McIlroy sent me a copy, it's now here:
>
>
> https://www.tuhs.org/Archive/Documentation/TechReports/Bell_Labs/PinheadWebster.pdf
>
> Thanks Doug!
>
And many thanks from me and my colleague as well, Doug!
For future searchers, what follows is selected (unique) front matter from
the memo, rewrapped slightly for Mailman width.
Title - Webster's Second on the Head of a Pin
Date - July 15, 1974
TM - 74-1271-13
Other keywords - words, text compression
Author Location Extension
Robert Morris MH 2C-524 3878
Ken Thompson MH 2C-523 2394
Charging case - 39199
Filing Case - 39199-11
ABSTRACT
We used the list of words from Webster's Second Unabridged Dictionary
(without definitions) as a test case for special purpose text
compression techniques.
We compressed it by a factor of 4.52 to 1.
The 234,932 words originally occupied 2,486,781 bytes and were
compressed into 549,388 bytes. The size of the decoding program is
1356 bytes.
The initial characters of a word that agreed with the initial
characters of the previous word were dropped and replaced by a code.
Common suffixes were also coded. Finally, a variable-length code was
used.
Pages Text 6 Other 0 Total 6
--
Royce
[-- Attachment #2: Type: text/html, Size: 2641 bytes --]
prev parent reply other threads:[~2024-12-30 2:08 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-12-29 2:53 [TUHS] " Royce Williams
2024-12-29 13:44 ` [TUHS] " Douglas McIlroy
2024-12-30 21:13 ` sjenkin
2024-12-31 16:37 ` Chet Ramey via TUHS
2025-01-01 15:02 ` Douglas McIlroy
2025-01-01 18:11 ` Rik Farrow
2025-01-02 3:05 ` Douglas McIlroy
2025-01-02 14:28 ` Chet Ramey via TUHS
2025-01-02 14:22 ` Chet Ramey via TUHS
2025-01-02 18:13 ` Rik Farrow
2025-01-02 19:47 ` Chet Ramey via TUHS
2024-12-29 16:05 ` Douglas McIlroy
2025-03-10 22:55 ` James Johnston
2025-03-11 0:23 ` Douglas McIlroy
2025-03-11 14:47 ` Jeff Johnson
2025-03-12 12:41 ` Douglas McIlroy
2024-12-30 0:37 ` Warren Toomey via TUHS
2024-12-30 2:07 ` Royce Williams [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CA+E3k90PXbXt1tAGwovtnAWv6TK6gT9=ot6tW97jzKW3wA_YRg@mail.gmail.com' \
--to=royce@techsolvency.com \
--cc=tuhs@tuhs.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).