From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/55970 Path: main.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.gnus.general Subject: Re: spam-stat.el and mime Date: Mon, 12 Jan 2004 16:37:28 -0500 Organization: =?koi8-r?q?=F4=C5=CF=C4=CF=D2=20=FA=CC=C1=D4=C1=CE=CF=D7?= @ Cienfuegos Sender: ding-owner@lists.math.uh.edu Message-ID: <4n8ykcrgpz.fsf@collins.bwh.harvard.edu> References: <87u133g3f4.fsf@andy.bu.edu> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1073943570 15320 80.91.224.253 (12 Jan 2004 21:39:30 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 12 Jan 2004 21:39:30 +0000 (UTC) Cc: ding@gnus.org Original-X-From: ding-owner+M4510@lists.math.uh.edu Mon Jan 12 22:39:23 2004 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Ag9mN-00049W-00 for ; Mon, 12 Jan 2004 22:39:23 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1Ag9mG-0007MU-00; Mon, 12 Jan 2004 15:39:16 -0600 Original-Received: from justine.libertine.org ([66.139.78.221] ident=postfix) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1Ag9mB-0007MP-00 for ding@lists.math.uh.edu; Mon, 12 Jan 2004 15:39:11 -0600 Original-Received: from clifford.bwh.harvard.edu (clifford.bwh.harvard.edu [134.174.9.41]) by justine.libertine.org (Postfix) with ESMTP id 6262E3A0033 for ; Mon, 12 Jan 2004 15:39:11 -0600 (CST) Original-Received: from collins.bwh.harvard.edu (collins [134.174.9.80]) by clifford.bwh.harvard.edu (8.10.2+Sun/8.11.0) with ESMTP id i0CLbXW07953; Mon, 12 Jan 2004 16:37:33 -0500 (EST) Original-Received: from collins.bwh.harvard.edu (localhost [127.0.0.1]) by collins.bwh.harvard.edu (8.12.9+Sun/8.11.0) with ESMTP id i0CLbSuB022269; Mon, 12 Jan 2004 16:37:28 -0500 (EST) Original-Received: (from tzz@localhost) by collins.bwh.harvard.edu (8.12.9+Sun/8.12.9/Submit) id i0CLbSAI022266; Mon, 12 Jan 2004 16:37:28 -0500 (EST) Original-To: Andrew Cohen X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" Mail-Followup-To: Andrew Cohen , ding@gnus.org In-Reply-To: <87u133g3f4.fsf@andy.bu.edu> (Andrew Cohen's message of "Sat, 10 Jan 2004 11:43:27 -0500") User-Agent: Gnus/5.110002 (No Gnus v0.2) Emacs/21.3.50 (usg-unix-v) Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:55970 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:55970 On Sat, 10 Jan 2004, cohen@andy.bu.edu wrote: > I've been using spam-stat.el for ages, but was unhappy that it only > had a success rate of about 97%. Checking a bit this was almost > entirely because it did no decoding of mime (or base64) encoded > articles. I've modified it to decode mime (if you don't like this it > can be controlled by customizing the spam-treat-mime-function to > nil). > > After retraining, I now have a false-positive rate of less than .08% > (no false positives on my test directory of 1300 ham emails) and a > success rate of detecting spam of about 99.8%, which is as good or > better than any of the other Bayesian filters I've played with. That's very cool, but shouldn't it go into spam.el so other backends besides spam-stat can use it? If you can incorporate Jesper's suggestions into your code, you're welcome to make a patch to put the code in spam.el or let me do it. This is definitely a good feature for the Gnus users. Thanks Ted