From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/50677 Path: main.gmane.org!not-for-mail From: Alex Schroeder Newsgroups: gmane.emacs.gnus.general Subject: Re: spam.el is a bit aggressive loading/saving spam-stat data Date: Sat, 08 Mar 2003 01:49:10 +0100 Sender: owner-ding@hpc.uh.edu Message-ID: <87u1eewum1.fsf@gnu.org> References: <4nd6ll3bus.fsf@lockgroove.bwh.harvard.edu> <4nk7ft1hqj.fsf@lockgroove.bwh.harvard.edu> <874r6xtccq.fsf@emacswiki.org> <4nisv91h2t.fsf@chubby.bwh.harvard.edu> <4nel5u3frk.fsf@chubby.bwh.harvard.edu> <4nisuw5wsm.fsf@lockgroove.bwh.harvard.edu> <4nadg7446p.fsf@surf.bwh.harvard.edu> <87y93qu6qd.fsf@jeeves.blindglobe.net> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1047084599 966 80.91.224.249 (8 Mar 2003 00:49:59 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sat, 8 Mar 2003 00:49:59 +0000 (UTC) Original-X-From: owner-ding@hpc.uh.edu Sat Mar 08 01:49:56 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18rSXE-0000F5-00 for ; Sat, 08 Mar 2003 01:49:56 +0100 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 18rSWy-0000Kd-00; Fri, 07 Mar 2003 18:49:40 -0600 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Fri, 07 Mar 2003 18:50:39 -0600 (CST) Original-Received: from sclp3.sclp.com (sclp3.sclp.com [66.230.238.2]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id SAA10835 for ; Fri, 7 Mar 2003 18:50:24 -0600 (CST) Original-Received: (qmail 21511 invoked by alias); 8 Mar 2003 00:49:20 -0000 Original-Received: (qmail 21506 invoked from network); 8 Mar 2003 00:49:20 -0000 Original-Received: from quimby.gnus.org (80.91.224.244) by 66.230.238.6 with SMTP; 8 Mar 2003 00:49:20 -0000 Original-Received: from news by quimby.gnus.org with local (Exim 3.12 #1 (Debian)) id 18rSqz-0006EF-00 for ; Sat, 08 Mar 2003 02:10:21 +0100 Original-To: ding@gnus.org Original-Path: not-for-mail Original-Newsgroups: gnus.ding Original-Lines: 21 Original-NNTP-Posting-Host: dclient217-162-33-110.hispeed.ch Original-X-Trace: quimby.gnus.org 1047085821 23708 217.162.33.110 (8 Mar 2003 01:10:21 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: 8 Mar 2003 01:10:21 GMT Mail-Followup-To: ding@gnus.org Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAG1BMVEX///89Pjl6hX25yeD/ 373gtJdgTEW6a14sduMzR7J4AAAAAXRSTlMAQObYZgAAAeJJREFUeNp1lMGSmzAMhvMKolOaa5wB cg3p5gGy8va8xCZn0658LlPg3Bw6PHZlArbZ2XUOJHz+f1mylM3pk7UJX8t/zf4DUOYAlJJnHuyA ARgCIVagNLCsZA0grMkuBrQQ5/aRAtII7GIA+f4TAM11AWYN4OcMntwPiuh+E2I3Q0GUphNNF3Di /UU2DD29UzxB2rZZpat+8iQKVkmRaV7VtJ988BPYttMaUebrU5XQuP0KsTJrkAoHkEH+DnQzmCVf PJiskL3kbq0goW996zR1DLhWqb0RtBniS5ygK2Jy6/lsCl8M57FU1135V6xTOyi8pDYDfx85mF+o h46DXKj7QQsoBdEflC5DvGRKZl4hsp5fToAfOlytEAPOCn6qAIqiUVJ2DJRTBnAuLGd+69Alj/Ia QG8s6dubcgJVBkVJ9I3rOExBpIgURG9aYuWKJesmKE5Edpjy49B5E81Hbsg+7hDxdxKBM1vNV/Vs ksiqtFxA5cD3V4itarKt5sPK51cDIjouWO4rKZE7y8A9gCM0mVaSo5ir2R68FTc194l7XycCxoNX HF23Z7oSBYkdHILVY4yaPDdJ9Xc7enD0Y2mSHMZxvM/AjxIBg/soZnD2fwrcXrBlhXiAZYweQzje +XP6D3vk9qXUjAFlAAAAAElFTkSuQmCC User-Agent: Gnus/5.090016 (Oort Gnus v0.16) Emacs/21.2.92 Cancel-Lock: sha1:5i304PVMF1UdHGoQKOSwadE3Y98= Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:50677 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:50677 rossini@blindglobe.net (A.J. Rossini) writes: > Actually, that generates an interesting statistical question -- how to > estimate the temporal window/down-weighting of scores to adaptively > optimize sensitivity/specificity in a time-heterogeneous setting, > within the context of a reciever-operating-characteristic (ROC) > curve... Heh. Whatever. :) It works for me because I use only spam-stat.el -- no spam.el! I have a few thousand mails in mail.misc and mail.spam -- and should I ever feel that I have too much (eg. more than a year worth of spam), then I can just delete it manually. Every now and then I delete my dictionary and run an Emacs just to recompute the dictionary. The size of the ~/.spam-stat.el file is currently 447272 bytes, 28143 words, 5290 non-spam mails, and 466 spam mails. It works very well. At work I have over 4000 spam mails and a much smaller number of non-spam mail. A lot of spam is not caught. Alex.