From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/46217 Path: main.gmane.org!not-for-mail From: Alex Schroeder Newsgroups: gmane.emacs.gnus.general Subject: Re: Paul Graham on fighting SPAM Date: Thu, 22 Aug 2002 21:59:27 +0200 Sender: owner-ding@hpc.uh.edu Message-ID: <87ofbuzmu8.fsf@emacswiki.org> References: <87d6sf42ys.fsf@emacswiki.org> <871y8u7un8.fsf@emacswiki.org> <87fzxa7ala.fsf@emacswiki.org> <87d6se9dsy.fsf@emacswiki.org> <87vg637hvh.fsf@emacswiki.org> NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: quoted-printable X-Trace: main.gmane.org 1030046315 28571 127.0.0.1 (22 Aug 2002 19:58:35 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 22 Aug 2002 19:58:35 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 17hy6D-0007Qh-00 for ; Thu, 22 Aug 2002 21:58:33 +0200 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 17hy6P-0002CF-00; Thu, 22 Aug 2002 14:58:45 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Thu, 22 Aug 2002 14:59:16 -0500 (CDT) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id OAA01179 for ; Thu, 22 Aug 2002 14:59:03 -0500 (CDT) Original-Received: (qmail 23772 invoked by alias); 22 Aug 2002 19:58:26 -0000 Original-Received: (qmail 23767 invoked from network); 22 Aug 2002 19:58:26 -0000 Original-Received: from relay03.cablecom.net (62.2.33.103) by gnus.org with SMTP; 22 Aug 2002 19:58:26 -0000 Original-Received: from smtp.swissonline.ch (mail-4.swissonline.ch [62.2.32.85]) by relay03.cablecom.net (8.12.5/8.12.5/SOL/AWF/MXRELAY/20020820) with ESMTP id g7MJvqMO059407 for ; Thu, 22 Aug 2002 21:57:58 +0200 (CEST) (envelope-from alex@emacswiki.org) Original-Received: from confusibombus (dclient217-162-239-43.hispeed.ch [217.162.239.43]) by smtp.swissonline.ch (8.11.6/8.11.6/SMTPSOL/AWF/2002040101) with ESMTP id g7MJvpH23197 for ; Thu, 22 Aug 2002 21:57:52 +0200 (MEST) Original-Received: from alex by confusibombus with local (Exim 3.35 #1 (Debian)) id 17hy75-00005J-00 for ; Thu, 22 Aug 2002 21:59:27 +0200 Original-To: ding@gnus.org X-Face: ^BC$`[IcggstLPyen&dqF+b2'zyK#r.mU*'Nms}@&4zw%SJ#5!/7SMVjBS7'lb;QK)|IPU5U'o1'522W4TyzB3Ab*IBo^iw]l4|kUbdZuUDO6=Um-.4IzhNiV'B"@K#jy_(wW|Zbk[34flKY^|PrQ?$u2\fKg^]AY>wOX#H32i In-Reply-To: (Kai.Grossjohann@CS.Uni-Dortmund.DE's message of "Thu, 22 Aug 2002 19:57:19 +0200") Original-Lines: 20 User-Agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.2.90 (i686-pc-linux-gnu) Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:46217 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:46217 Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Gro=DFjohann) writes: > Maybe the folks over at comp.theory.info-retrieval know more about > this? There should be some experts hanging out there. Hehe, thought about this as well. After all, the new message was considered to be spam, so spam-stat might as well learn about the new words in it. If the user then changes his mind, I have functions to deal with that. What do you say? After all, the message is *already* in mail.spam, so to answer "yes" means to learn as we go, let the user change the scores if he wants; to answer "no" means that we only learn after the user has confirmed the decisions spam-stat made. If we were talking about a neural network, then there'd be the question of overtraining, going of on tangents, and no way to undo it. But all spam-stat knows is "how often was this word in the good and the bad mails?" and "how many good and bad mails are there?" -- that is easy to undo. Lucky us. :) Alex.