From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by yquem.inria.fr (Postfix) with ESMTP id D678DBB9C for ; Thu, 17 Nov 2005 21:58:05 +0100 (CET) Received: from pauillac.inria.fr (pauillac.inria.fr [128.93.11.35]) by concorde.inria.fr (8.13.0/8.13.0) with ESMTP id jAHKw5H5006746 for ; Thu, 17 Nov 2005 21:58:05 +0100 Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id VAA27949 for ; Thu, 17 Nov 2005 21:58:05 +0100 (MET) Received: from einhorn.in-berlin.de (einhorn.in-berlin.de [192.109.42.8]) by concorde.inria.fr (8.13.0/8.13.0) with ESMTP id jAHKw456006742 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) for ; Thu, 17 Nov 2005 21:58:04 +0100 X-Envelope-From: oliver@first.in-berlin.de X-Envelope-To: Received: from first.in-berlin.de (e178009010.adsl.alicedsl.de [85.178.9.10]) (authenticated bits=0) by einhorn.in-berlin.de (8.12.10/8.12.10/Debian-4) with ESMTP id jAHKw0t4018242 for ; Thu, 17 Nov 2005 21:58:02 +0100 Received: by first.in-berlin.de (Postfix, from userid 501) id AA98E193C7D; Thu, 17 Nov 2005 21:57:22 +0100 (CET) Date: Thu, 17 Nov 2005 21:57:22 +0100 From: Oliver Bandel To: caml-list@inria.fr Subject: Re: [Caml-list] [1/2 OT] Indexing (and mergeable Index-algorithms) Message-ID: <20051117205722.GA492@first.in-berlin.de> References: <20051116234238.GA5741@first.in-berlin.de> <437C40EE.7040005@bik-gmbh.de> <20051117092430.GA521@first.in-berlin.de> <87u0ebih2o.fsf@mid.deneb.enyo.de> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87u0ebih2o.fsf@mid.deneb.enyo.de> User-Agent: Mutt/1.5.6i X-Scanned-By: MIMEDefang_at_IN-Berlin_e.V. on 192.109.42.8 X-Miltered: at concorde with ID 437CEEDD.000 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Miltered: at concorde with ID 437CEEDC.001 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Spam: no; 0.00; oliver:01 bandel:01 oliver:01 in-berlin:01 caml-list:01 indexing:01 bandel:01 indexing:01 inverted:01 reuse:01 iirc:01 full-text:98 full-text:98 wrote:01 incremental:01 X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.1 required=5.0 tests=FORGED_RCVD_HELO autolearn=disabled version=3.0.3 On Thu, Nov 17, 2005 at 01:39:43PM +0100, Florian Weimer wrote: > * Oliver Bandel: > > > I found an interesting paper there, about using updatable indexing > > ("Fast Incremental Indexing for Full-Text Information Retrieval" > > from Brown/Callen/Croft.) They talked about "inverted lists", and > > this together with other hints from this list may be a good starter. > > If you need a full-text search capability on natural language > documents, there are various libraries you could reuse: Xapian, Lucene > (although this is one is writtein Java, IIRC), Estraier, OpenFTS, a > MySQL component, and probably many more. constraint: have to use PostgreSQL. Does it have fulltext search? Ciao, Oliver