The Unix Heritage Society mailing list
 help / color / mirror / Atom feed
From: Douglas McIlroy <douglas.mcilroy@dartmouth.edu>
To: James Johnston <audioskeptic@gmail.com>
Cc: TUHS <tuhs@tuhs.org>
Subject: [TUHS] Re: "Webster's Second on the Head of a Pin"?
Date: Mon, 10 Mar 2025 20:23:30 -0400	[thread overview]
Message-ID: <CAKH6PiV2qQGvDd+PNDBYYeEfZ8pBTb8KVx2=YNNsVe4ma7-+cg@mail.gmail.com> (raw)
In-Reply-To: <CAO2qRdMiEtL0h8CQiMBv53XAKQp7vVrP7oBxdv4SBVrOv9B+xQ@mail.gmail.com>

Gnu lzip -9  got a 3.92 compression factor , while Morris-Thompson got 4.52.

According to the lzip man page, lzip has integrity checks, which must
take some space. It is also said to be a "simplified" L-Z, though the
only simplification that is mentioned--bounding the size of the
dictionary--can only happen on much bigger files than Webster. If you
still wonder how well "pure" L-Z does, I'll be happy to try your
favorite implementation.

 It did not surprise me that lzip lost to M-T, for it has to discover
over and over again that the dictionary is lexicographically
ordered--a fact that gives M-T most of its leverage. The dictionary is
definitely not generated by a Markov process!

Doug

On Mon, Mar 10, 2025 at 6:56 PM James Johnston <audioskeptic@gmail.com> wrote:
>
> Doug, do you know how much better Ziv-Lempel does? I'm guessing "a little bit"?
>
> On Mon, Mar 10, 2025 at 3:52 PM Douglas McIlroy <douglas.mcilroy@dartmouth.edu> wrote:
>>
>> I have sent a copy to Warren for the Unix archives.
>>
>> Doug
>>
>> On Sat, Dec 28, 2024 at 9:54 PM Royce Williams <royce@techsolvency.com> wrote:
>> >
>> > Someone I know is seeking the original version of an internal Bell Labs memo from 1974 titled "Webster's Second on the Head of a Pin" by Morris and Thompson. The topic appears to be related to improving the speed of lookups or search. It's cited in a few papers as "Unpublished Technical Memo, Bell Laboratories, Murray Hill, NJ 1974." All I can find online is citations. Any leads appreciated!
>> >
>> > --
>> > Royce
>
>
>
> --
> James D. (jj) Johnston
>
> Former Chief Scientist, Immersion Networks

  reply	other threads:[~2025-03-11  0:23 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-12-29  2:53 [TUHS] " Royce Williams
2024-12-29 13:44 ` [TUHS] " Douglas McIlroy
2024-12-30 21:13   ` sjenkin
2024-12-31 16:37   ` Chet Ramey via TUHS
2025-01-01 15:02     ` Douglas McIlroy
2025-01-01 18:11       ` Rik Farrow
2025-01-02  3:05         ` Douglas McIlroy
2025-01-02 14:28           ` Chet Ramey via TUHS
2025-01-02 14:22         ` Chet Ramey via TUHS
2025-01-02 18:13           ` Rik Farrow
2025-01-02 19:47             ` Chet Ramey via TUHS
2024-12-29 16:05 ` Douglas McIlroy
2025-03-10 22:55   ` James Johnston
2025-03-11  0:23     ` Douglas McIlroy [this message]
2025-03-11 14:47       ` Jeff Johnson
2025-03-12 12:41         ` Douglas McIlroy
2024-12-30  0:37 ` Warren Toomey via TUHS
2024-12-30  2:07   ` Royce Williams

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAKH6PiV2qQGvDd+PNDBYYeEfZ8pBTb8KVx2=YNNsVe4ma7-+cg@mail.gmail.com' \
    --to=douglas.mcilroy@dartmouth.edu \
    --cc=audioskeptic@gmail.com \
    --cc=tuhs@tuhs.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).