From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/31793 Path: main.gmane.org!not-for-mail From: Harry Putnam Newsgroups: gmane.emacs.gnus.general Subject: nnir/freeWAIS-sf Date: 15 Jul 2000 06:53:03 -0700 Sender: owner-ding@hpc.uh.edu Message-ID: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1035168163 16080 80.91.224.250 (21 Oct 2002 02:42:43 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 02:42:43 +0000 (UTC) Return-Path: Original-Received: from fisher.math.uh.edu (fisher.math.uh.edu [129.7.128.35]) by mailhost.sclp.com (Postfix) with ESMTP id DC20ED051E for ; Sat, 15 Jul 2000 09:55:30 -0400 (EDT) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by fisher.math.uh.edu (8.9.1/8.9.1) with ESMTP id IAC15970; Sat, 15 Jul 2000 08:55:18 -0500 (CDT) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Sat, 15 Jul 2000 08:54:27 -0500 (CDT) Original-Received: from mailhost.sclp.com (postfix@sclp3.sclp.com [204.252.123.139]) by sina.hpc.uh.edu (8.9.3/8.9.3) with ESMTP id IAA09649 for ; Sat, 15 Jul 2000 08:54:18 -0500 (CDT) Original-Received: from mail.networkone.net (mail.networkone.net [209.144.112.75]) by mailhost.sclp.com (Postfix) with SMTP id 963AFD051E for ; Sat, 15 Jul 2000 09:54:47 -0400 (EDT) Original-Received: (qmail 17931 invoked from network); 15 Jul 2000 13:54:46 -0000 Original-Received: from adsl-117-113.ln.networkone.net (HELO reader.ptw.com) (209.144.117.113) by mail.networkone.net with SMTP; 15 Jul 2000 13:54:46 -0000 Original-Received: (from reader@localhost) by reader.ptw.com (8.9.3/8.9.3) id GAA12465; Sat, 15 Jul 2000 06:54:44 -0700 Original-To: ding@gnus.org User-Agent: Gnus/5.0807 (Gnus v5.8.7) Emacs/20.5 Original-Lines: 94 Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:31793 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:31793 FreeWAIS seems to be a difficult beast to deal with... I finally got version 2.2.13 installed successfully on a FreeBSD OS. Still having poor luck getting freeWAIS to co-operate with nnir. In brief, the problem areas I encounter are: 1) The C-u G G (allow group selection) option does not work with freeWAIS 2) Queries aimed at `from' or `to' fields fail, although queries to `subject' field or global queries, work First a few facts about the basis of this report: 1) I've created a ~/Mail directory containing only two sub directories. My collections of messages to ding list and bbdb list So ~/Mail/ ding bbdb 2) Using the example *.fmt file from nnir-1.57.el and giving it the title mail.fmt: # Kai's format file for freeWAIS-sf for indexing mails. # Each mail is in a file, much like the MH format. # Document separator should never match -- each file is a document. record-sep: /^@this regex should never match@$/ # Searchable fields specification. region: /^[sS]ubject:/ /^[sS]ubject: */ subject "Subject header" stemming TEXT BOTH end: /^[^ \t]/ region: /^([tT][oO]|[cC][cC]):/ /^([tT][oO]|[cC][cC]): */ to "To and Cc headers" SOUNDEX BOTH end: /^[^ \t]/ region: /^[fF][rR][oO][mM]:/ /^[fF][rR][oO][mM]: */ from "From header" SOUNDEX BOTH end: /^[^ \t]/ region: /^$/ stemming TEXT GLOBAL end: /^@this regex should never match@$/ 3) Wais command line used to create the index: waisindex -r -d mail -stem -t fields ~/Mail (also tried without -stem getting the same results: waisindex -r -d mail -t fields ~/Mail ) 4) These settings in .gnus: (load "nnir-1.57.elc") (setq nnir-wais-database "/home/reader/.wais/mail") (setq nnir-search-engine `wais) In gnus Group buffer pressing `C-u G G': Query: nnir (No prompt to select group spec appears during any of what follows) Gives almost instantaneous results of two messages from ding list Query: from=Kai Gives: Couldn't request group: Search produced empty results (but we all know better..: grep -r ^From:.*Kai' Mail/ding2|wc -l 150) I read somewhere that freeWAIS has trouble with words containing both upper and lower case so: Query: from=rossjohann Gives: Couldn't request group: Search produced empty results (But again we know better: grep -v 'From:.*rossjohann' ~/Mail|wc -l 150) Further: Query: to=ding or to=bbdb both give the no results message There are *NO* stop words in the index Subject queries work: Query: subject=nnir Gives intant results from ding group Query: subject=postal Gives instant results from bbdb group Query: subject=give Gives intant results from both groups