From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/54852 Path: main.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.gnus.general Subject: Re: spam.el: automatically resplitting ham in a spam group? Date: Wed, 19 Nov 2003 16:24:30 -0500 Organization: =?koi8-r?q?=F4=C5=CF=C4=CF=D2=20=FA=CC=C1=D4=C1=CE=CF=D7?= @ Cienfuegos Sender: ding-owner@lists.math.uh.edu Message-ID: <4nu150car5.fsf@lockgroove.bwh.harvard.edu> References: <4nfzgkdqzx.fsf@lockgroove.bwh.harvard.edu> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1069277147 14592 80.91.224.253 (19 Nov 2003 21:25:47 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 19 Nov 2003 21:25:47 +0000 (UTC) Cc: ding@gnus.org Original-X-From: ding-owner+M3392=ding+2Daccount=gmane.org@lists.math.uh.edu Wed Nov 19 22:25:44 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1AMZpY-0005n3-00 for ; Wed, 19 Nov 2003 22:25:44 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1AMZpX-00064S-02 for ding-account@gmane.org; Wed, 19 Nov 2003 15:25:43 -0600 Original-Received: from justine.libertine.org ([66.139.78.221] ident=postfix) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1AMZpS-00064L-00 for ding@lists.math.uh.edu; Wed, 19 Nov 2003 15:25:38 -0600 Original-Received: from clifford.bwh.harvard.edu (clifford.bwh.harvard.edu [134.174.9.41]) by justine.libertine.org (Postfix) with ESMTP id 33CF33A0047 for ; Wed, 19 Nov 2003 15:25:38 -0600 (CST) Original-Received: from lockgroove.bwh.harvard.edu (lockgroove [134.174.9.133]) by clifford.bwh.harvard.edu (8.10.2+Sun/8.11.0) with ESMTP id hAJLOx703918; Wed, 19 Nov 2003 16:24:59 -0500 (EST) Original-Received: (from tzz@localhost) by lockgroove.bwh.harvard.edu (8.11.6+Sun/8.11.0) id hAJLOUr04120; Wed, 19 Nov 2003 16:24:30 -0500 (EST) Original-To: Jody Klymak X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" Mail-Followup-To: Jody Klymak , ding@gnus.org In-Reply-To: (Jody Klymak's message of "Wed, 19 Nov 2003 12:57:52 -0800") User-Agent: Gnus/5.1003 (Gnus v5.10.3) Emacs/21.3.50 (usg-unix-v) Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:54852 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:54852 On Wed, 19 Nov 2003, jklymak@coas.oregonstate.edu wrote: > Hello Ted, > > Ted Zlatanov writes: > >> By the way, recently I added multiple >> spam/ham-process-destinations. So now from a spam group, you can >> send ham to "INBOX" and to a "train_ham" group. I find that very >> useful for training, since "train_ham" will only contain the >> misclassified messages. With your setup, you may like it too. > > What is the rationale behind this? Do you do the training on the > mail server? Yes. > I train locally using bogofilter, and have found simply training on > misclassified messages to work quite well. ie. only train on spam > in my ham group and only on ham in my spam group. So I have been > curious about these train_ham and train_spam groups. Correct, for local training bogofilter is fine. I can't use it with my ISP, though. A change that's coming is that I'll make all the registration functions batch-oriented - so instead of registering one message at a time, all of them can be registered at once. That should speed things up enormously, and it should make remote training possible (you can save the articles in a file and send it over via ssh or whatever mechanism you like). Ted