From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/46199 Path: main.gmane.org!not-for-mail From: (Chris Beggy ) news@kippona.com Newsgroups: gmane.emacs.gnus.general Subject: Re: Paul Graham on fighting SPAM Date: Tue, 20 Aug 2002 21:14:26 -0400 Organization: Kippona Sender: owner-ding@hpc.uh.edu Message-ID: <87ptwdxbbh.fsf@lackawana.kippona.com> References: Reply-To: Chris Beggy NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1029892580 26377 127.0.0.1 (21 Aug 2002 01:16:20 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 21 Aug 2002 01:16:20 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17hK6c-0006rK-00 for ; Wed, 21 Aug 2002 03:16:18 +0200 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 17hK5G-0002QX-00; Tue, 20 Aug 2002 20:14:54 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Tue, 20 Aug 2002 20:15:27 -0500 (CDT) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id UAA27459 for ; Tue, 20 Aug 2002 20:15:14 -0500 (CDT) Original-Received: (qmail 28290 invoked by alias); 21 Aug 2002 01:14:32 -0000 Original-Received: (qmail 28284 invoked from network); 21 Aug 2002 01:14:32 -0000 Original-Received: from lackawana.kippona.com (root@207.8.195.148) by gnus.org with SMTP; 21 Aug 2002 01:14:32 -0000 Original-Received: from lackawana.kippona.com (nobody@localhost [127.0.0.1]) by lackawana.kippona.com (8.12.4/8.12.4/Kippona) with ESMTP id g7L1EUQf030525 (version=TLSv1/SSLv3 cipher=EDH-RSA-DES-CBC3-SHA bits=168 verify=NO) for ; Tue, 20 Aug 2002 21:14:30 -0400 Original-Received: (from nobody@localhost) by lackawana.kippona.com (8.12.4/8.12.4/Submit/Kippona) id g7L1EUe5030524; Tue, 20 Aug 2002 21:14:30 -0400 X-Reply-To: Chris Beggy Original-Lines: 24 Original-X-Trace: lackawana.kippona.com 1029892467 30129 207.8.195.148 (21 Aug 2002 01:14:27 GMT) Original-X-Complaints-To: abuse@kippona.com X-gpgkeyid: 0x8060510A X-fingerprint: 6012 F8F8 29B3 67E4 0604 BCD2 F882 88AE 8060 510A Cancel-Lock: sha1:P70UxZoYmRMlsktXjhg4QjECyoc= Original-To: ding@gnus.org Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:46199 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:46199 Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes: > prj@po.cwru.edu (Paul Jarc) writes: > >> Sounds like it. Anyone know if this (or another) method generalizes >> to more than two categories (spam/nonspam)? If so, it could be used >> for all mail splitting. We wouldn't have to manually craft split >> rules; we'd just seed a new group with the mails we have so far that >> belong there, and their contents would let the computer guess which >> new mails belong with them. > > I think you can assume that text classifiers can choose one out of N > categories. In some cases, this is implemented by having N yes/no > classifiers and choosing the result from the one with the highest > confidence value, but this is only an implementation detail. > > Using automatic classification for splitting would really be way > cool. Are there any mail readers which do this? Mew does this with its mew-refile-guess-* functions, lists, and variables. It's not strictly automatic, but remembers and learns from previous refiles (splitting, in gnusspeak.) Chris