From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/46256 Path: main.gmane.org!not-for-mail From: Alex Schroeder Newsgroups: gmane.emacs.gnus.general Subject: Re: Paul Graham on fighting SPAM Date: Mon, 26 Aug 2002 23:55:46 +0200 Sender: owner-ding@hpc.uh.edu Message-ID: <87wuqd5lp9.fsf@emacswiki.org> References: <87d6sf42ys.fsf@emacswiki.org> <871y8u7un8.fsf@emacswiki.org> <87fzxa7ala.fsf@emacswiki.org> <87d6se9dsy.fsf@emacswiki.org> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1030399062 9595 127.0.0.1 (26 Aug 2002 21:57:42 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 26 Aug 2002 21:57:42 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17jRre-0002SK-00 for ; Mon, 26 Aug 2002 23:57:39 +0200 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 17jRoi-0005LS-00; Mon, 26 Aug 2002 16:54:36 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Mon, 26 Aug 2002 16:55:08 -0500 (CDT) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id QAA09182 for ; Mon, 26 Aug 2002 16:54:44 -0500 (CDT) Original-Received: (qmail 20650 invoked by alias); 26 Aug 2002 21:53:59 -0000 Original-Received: (qmail 20645 invoked from network); 26 Aug 2002 21:53:59 -0000 Original-Received: from quimby.gnus.org (80.91.224.244) by gnus.org with SMTP; 26 Aug 2002 21:53:59 -0000 Original-Received: from news by quimby.gnus.org with local (Exim 3.12 #1 (Debian)) id 17jSHl-0007PG-00 for ; Tue, 27 Aug 2002 00:24:37 +0200 Original-To: ding@gnus.org Original-Path: not-for-mail Original-Newsgroups: gnus.ding Original-Lines: 19 Original-NNTP-Posting-Host: dclient217-162-239-43.hispeed.ch Original-X-Trace: quimby.gnus.org 1030400677 27747 217.162.239.43 (26 Aug 2002 22:24:37 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: 26 Aug 2002 22:24:37 GMT X-Face: ^BC$`[IcggstLPyen&dqF+b2'zyK#r.mU*'Nms}@&4zw%SJ#5!/7SMVjBS7'lb;QK)|IPU5U'o1'522W4TyzB3Ab*IBo^iw]l4|kUbdZuUDO6=Um-.4IzhNiV'B"@K#jy_(wW|Zbk[34flKY^|PrQ?$u2\fKg^]AY>wOX#H32i User-Agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.2.90 (i686-pc-linux-gnu) Cancel-Lock: sha1:SkcFul2qzyNSPFxAFi5NGmuauUY= Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:46256 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:46256 Anyway, what shall we do with spam-stat.el, now? An ifile user suggested I write code to reduce the dictionary size again -- perhaps I should remove all the words occuring less than 5 times, and all words whose spaminess is close to 0.5 (common words occuring both in spam and non-spam), and only the first few kb of all mails should be analyzed. Maybe I should write a sample usage that updates the dictionary whenever you move mails into or out of the mail.spam group. I know this is not what Teodor Zlatanov has in mind, but at least I think I could do it myself. Has anybody used spam-stat.el at all? I must confess that I have not yet added it to the fancy split rules myself, so perhaps we should first start using it, before we start improving it. Or should I just assign the copyright of spam-stat.el, then we can move it into spam.el or whatever, and people will fix it as spam.el gets wider usage? Alex.