From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by yquem.inria.fr (Postfix) with ESMTP id F3FA1BB9C for ; Thu, 17 Nov 2005 10:25:11 +0100 (CET) Received: from pauillac.inria.fr (pauillac.inria.fr [128.93.11.35]) by concorde.inria.fr (8.13.0/8.13.0) with ESMTP id jAH9PBlG022182 for ; Thu, 17 Nov 2005 10:25:11 +0100 Received: from nez-perce.inria.fr (nez-perce.inria.fr [192.93.2.78]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id KAA12681 for ; Thu, 17 Nov 2005 10:25:10 +0100 (MET) Received: from einhorn.in-berlin.de (einhorn.in-berlin.de [192.109.42.8]) by nez-perce.inria.fr (8.13.0/8.13.0) with ESMTP id jAH9P96q010177 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) for ; Thu, 17 Nov 2005 10:25:10 +0100 X-Envelope-From: oliver@first.in-berlin.de X-Envelope-To: Received: from first.in-berlin.de (e178009010.adsl.alicedsl.de [85.178.9.10]) (authenticated bits=0) by einhorn.in-berlin.de (8.12.10/8.12.10/Debian-4) with ESMTP id jAH9P82h009329 for ; Thu, 17 Nov 2005 10:25:09 +0100 Received: by first.in-berlin.de (Postfix, from userid 501) id 9DB77192F6E; Thu, 17 Nov 2005 10:24:30 +0100 (CET) Date: Thu, 17 Nov 2005 10:24:30 +0100 From: Oliver Bandel To: caml-list@inria.fr Subject: Re: [Caml-list] [1/2 OT] Indexing (and mergeable Index-algorithms) Message-ID: <20051117092430.GA521@first.in-berlin.de> References: <20051116234238.GA5741@first.in-berlin.de> <437C40EE.7040005@bik-gmbh.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <437C40EE.7040005@bik-gmbh.de> User-Agent: Mutt/1.5.6i X-Scanned-By: MIMEDefang_at_IN-Berlin_e.V. on 192.109.42.8 X-Miltered: at concorde with ID 437C4C77.000 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Miltered: at nez-perce with ID 437C4C75.000 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Spam: no; 0.00; oliver:01 bandel:01 oliver:01 in-berlin:01 caml-list:01 indexing:01 hars:01 bandel:01 citeseer:01 citeseer:01 indexing:01 inverted:01 psu:98 full-text:98 wrote:01 X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.1 required=5.0 tests=FORGED_RCVD_HELO autolearn=disabled version=3.0.3 On Thu, Nov 17, 2005 at 09:35:58AM +0100, Florian Hars wrote: > Oliver Bandel wrote: > >It's mainly a question on datastructures/algorithms > > I tend to try to find answers to such questions on CiteSeer, maybe you > could start at the first paper I found with a quick search: > http://citeseer.ist.psu.edu/cutting90optimizations.html > and then look at the papers citing it, or similar to it. well, thats, where my further search directed me to. :) I found an interesting paper there, about using updatable indexing ("Fast Incremental Indexing for Full-Text Information Retrieval" from Brown/Callen/Croft.) They talked about "inverted lists", and this together with other hints from this list may be a good starter. > > (WARNING: Excessive use of CiteSeer may lead to addiction.) Yes, that's true. I was an addict and hope to get clean, but now google (and you too) directed me back... ;-) Ciao, Oliver