From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=AWL autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by yquem.inria.fr (Postfix) with ESMTP id D2E38BC6B for ; Sat, 28 Apr 2007 12:58:20 +0200 (CEST) Received: from einhorn.in-berlin.de (einhorn.in-berlin.de [192.109.42.8]) by concorde.inria.fr (8.13.6/8.13.6) with ESMTP id l3SAwKNc009570 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) for ; Sat, 28 Apr 2007 12:58:20 +0200 X-Envelope-From: oliver@first.in-berlin.de X-Envelope-To: Received: from first (dslb-088-073-111-021.pools.arcor-ip.net [88.73.111.21]) (authenticated bits=0) by einhorn.in-berlin.de (8.13.6/8.13.6/Debian-1) with ESMTP id l3SAwJwL006979 for ; Sat, 28 Apr 2007 12:58:19 +0200 Received: by first (Postfix, from userid 501) id 959603AE3F6; Sat, 28 Apr 2007 12:58:16 +0200 (CEST) Date: Sat, 28 Apr 2007 12:58:16 +0200 From: Oliver Bandel To: caml-list@inria.fr Subject: Re: [Caml-list] mboxlib reloaded ;-) Message-ID: <20070428105815.GB363@first.in-berlin.de> References: <20070427135425.GA1161@first.in-berlin.de> <20070427162911.GA10099@furbychan.cocan.org> <20070427231220.GA1507@first.in-berlin.de> <20070428075627.GA21610@furbychan.cocan.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20070428075627.GA21610@furbychan.cocan.org> User-Agent: Mutt/1.5.6i X-Scanned-By: MIMEDefang_at_IN-Berlin_e.V. on 192.109.42.8 X-Miltered: at concorde with ID 463328CC.000 by Joe's j-chkmail (http://j-chkmail . ensmp . fr)! X-Spam: no; 0.00; bandel:01 in-berlin:01 reloaded:01 0100,:01 0200,:01 bandel:01 0100,:01 threading:01 threading:01 28,:98 28,:98 disease:98 rot:98 reloads:98 trie:98 On Sat, Apr 28, 2007 at 08:56:28AM +0100, Richard Jones wrote: > On Sat, Apr 28, 2007 at 01:12:20AM +0200, Oliver Bandel wrote: > > On Fri, Apr 27, 2007 at 05:29:11PM +0100, Richard Jones wrote: > > > The source for COCANWIKI[1] contains extensive support for threading > > > of mail messages, based on JWZ's algorithm: > > > > > > http://www.jwz.org/doc/threading.html > > > > Nice... you speak of an optimized algorithm for threading. > > I didn't explored your solution nor did I explored your > > paper in detail (tomorrow I think I have the time to do it), > > I should point out that the algorithm is due to esteemed hacker[1] > Jamie Zawinski (http://jwz.org) who used it in Netscape versions 1 > through 3. They got C++/OO group-think disease from Netscape 4 and > above (the rot continues to this day). When I think about netscape, I think about the terrible way, how it handles newsgroups.... ...when doing a resort of the newsgroup-entries it's extremely slow. Maybe it again checks data on the newsserver again and again. And when I think about Netscape, I se that it reloads data via network, when you want to save it, even if it' already on the screen. So, theese things are the reason, why I don't see why this algorithm is something special, until I have seen that it really is fast. ;-) I have thought a while about the threading, and thought I have some good ideas. A while later, I saw, there already is a datastructure that is useful for threading, and it is the Tries-datastructure: http://www.csse.monash.edu.au/~lloyd/tildeAlgDS/Tree/Trie/ If you use the message ID's instead of the char's of a word, then this datastructure is well suited for news-/mail-threading, I think. So, other people already had done, what can be used here. But maybe thze algorithm you show on your pages is really good. But I would trust your algorithms more than algorithms of netscape-programmers ;-) But maybe it's really good and only the overall-design of Netscape was bad. ;-) Ciao, Oliver