From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/46164 Path: main.gmane.org!not-for-mail From: prj@po.cwru.edu (Paul Jarc) Newsgroups: gmane.emacs.gnus.general Subject: Re: Paul Graham on fighting SPAM Date: Mon, 19 Aug 2002 01:44:05 -0400 Organization: What did you have in mind? A short, blunt, human pyramid? Sender: owner-ding@hpc.uh.edu Message-ID: References: NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: quoted-printable X-Trace: main.gmane.org 1029735866 4224 127.0.0.1 (19 Aug 2002 05:44:26 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 19 Aug 2002 05:44:26 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17gfKy-00015i-00 for ; Mon, 19 Aug 2002 07:44:24 +0200 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 17gfKz-0005uq-00; Mon, 19 Aug 2002 00:44:25 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Mon, 19 Aug 2002 00:44:55 -0500 (CDT) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id AAA21878 for ; Mon, 19 Aug 2002 00:44:45 -0500 (CDT) Original-Received: (qmail 17864 invoked by alias); 19 Aug 2002 05:44:09 -0000 Original-Received: (qmail 17859 invoked from network); 19 Aug 2002 05:44:09 -0000 Original-Received: from multivac.student.cwru.edu (HELO multivac.cwru.edu) (@129.22.96.25) by gnus.org with SMTP; 19 Aug 2002 05:44:09 -0000 Original-Received: (qmail 13228 invoked by uid 500); 19 Aug 2002 05:44:28 -0000 Original-To: ding@gnus.org In-Reply-To: (Kai.Grossjohann@CS.Uni-Dortmund.DE's message of "Sat, 17 Aug 2002 21:43:05 +0200") Mail-Copies-To: nobody Mail-Followup-To: ding@gnus.org Original-Lines: 16 User-Agent: Gnus/5.090007 (Oort Gnus v0.07) Emacs/21.2 (i686-pc-linux-gnu) Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:46164 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:46164 Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Gro=DFjohann) wrote: > There is a research field known as "information filtering" or > "(automatic) text classification" or "text categorization". I don't > know the details of the theory, but folks in that community are > speaking of "naive Bayes classifiers" as one of the ways to do it -- > maybe that's similar to his approach. Sounds like it. Anyone know if this (or another) method generalizes to more than two categories (spam/nonspam)? If so, it could be used for all mail splitting. We wouldn't have to manually craft split rules; we'd just seed a new group with the mails we have so far that belong there, and their contents would let the computer guess which new mails belong with them. paul