From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/49004 Path: main.gmane.org!not-for-mail From: Alex Schroeder Newsgroups: gmane.emacs.gnus.general Subject: Re: Getting started with spam filtering Date: Thu, 09 Jan 2003 19:38:46 +0100 Sender: owner-ding@hpc.uh.edu Message-ID: <87r8bmjhhl.fsf@emacswiki.org> References: <84u1gkfa84.fsf@lucy.cs.uni-dortmund.de> <4n65szll2k.fsf@lockgroove.bwh.harvard.edu> <87fzs3p628.fsf@emacswiki.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1042137524 14027 80.91.224.249 (9 Jan 2003 18:38:44 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 9 Jan 2003 18:38:44 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18WhZi-0003e1-00 for ; Thu, 09 Jan 2003 19:38:42 +0100 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 18Wha7-0007gH-00; Thu, 09 Jan 2003 12:39:07 -0600 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Thu, 09 Jan 2003 12:40:02 -0600 (CST) Original-Received: from sclp3.sclp.com (sclp3.sclp.com [66.230.238.2]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id MAA18945 for ; Thu, 9 Jan 2003 12:39:49 -0600 (CST) Original-Received: (qmail 30012 invoked by alias); 9 Jan 2003 18:38:50 -0000 Original-Received: (qmail 30007 invoked from network); 9 Jan 2003 18:38:49 -0000 Original-Received: from quimby.gnus.org (80.91.224.244) by 66.230.238.6 with SMTP; 9 Jan 2003 18:38:49 -0000 Original-Received: from news by quimby.gnus.org with local (Exim 3.12 #1 (Debian)) id 18WheM-0004rW-00 for ; Thu, 09 Jan 2003 19:43:30 +0100 Original-To: ding@gnus.org Original-Path: not-for-mail Original-Newsgroups: gnus.ding Original-Lines: 53 Original-NNTP-Posting-Host: dclient217-162-234-9.hispeed.ch Original-X-Trace: quimby.gnus.org 1042137810 18693 217.162.234.9 (9 Jan 2003 18:43:30 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: 9 Jan 2003 18:43:30 GMT X-Face: ^BC$`[IcggstLPyen&dqF+b2'zyK#r.mU*'Nms}@&4zw%SJ#5!/7SMVjBS7'lb;QK)|IPU5U'o1'522W4TyzB3Ab*IBo^iw]l4|kUbdZuUDO6=Um-.4IzhNiV'B"@K#jy_(wW|Zbk[34flKY^|PrQ?$u2\fKg^]AY>wOX#H32i User-Agent: Gnus/5.090011 (Oort Gnus v0.11) Emacs/21.2.92 (i686-pc-linux-gnu) Cancel-Lock: sha1:lJ31V86KxuS1mH6G+4HbeTf4QiU= Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:49004 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:49004 Jorge Godoy writes: > Alex Schroeder writes: > >> Just for the record -- I use spam-stat.el without spam.el at the >> moment, and it works just fine. So if anybody has questions about how >> to get started, just ask and I will do my best, much as Ted is doing >> for spam.el. :) > > It would be interesting if you could post a step-by-step on how to get > it running. I can publish it on a website for you (or you can add it > to Emacs Wiki :-)) You can start by looking in the Gnus manual node "Filtering Spam Using Statistics (spam-stat.el)". The node "Creating a spam-stat dictionary" explains how to create your dictionary. The node "Splitting mail using spam-stat" explains how change your mail splitting setup. The short version is this: You need two nnml directories, one with spam, one with non-spam mails before you start! 1. Call `spam-stat-process-spam-directory' on `~/Mail/mail/spam'. 2. Call `spam-stat-process-non-spam-directory' on `~/Mail/mail/misc'. 3. Call `spam-stat-save' to save the dictionary. 4. Add this to your `~/.gnus' file: (require 'spam-stat) (spam-stat-load) 5. Change your mail splitting following the one of these examples: (setq nnmail-split-fancy `(| (: spam-stat-split-fancy) "mail.misc")) (setq nnmail-split-fancy `(| ("Content-Type" "text/html" "mail.spam.filtered") (: spam-stat-split-fancy) ("Subject" "\\bspam-stat\\b" "mail.emacs") "mail.misc")) I am grateful for any holes poking in the manual section on spam-stat.el -- only then can I improve it. I just noticed, for example, that the manual section that tells you to call spam-stat-save is not formatted correctly... Alex.