From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/55062 Path: main.gmane.org!not-for-mail From: David Z Maze Newsgroups: gmane.emacs.gnus.general Subject: Re: Spam/Ham training Date: Tue, 02 Dec 2003 13:55:09 -0500 Sender: ding-owner@lists.math.uh.edu Message-ID: <87d6b7oxrm.fsf@everett.mit.edu> References: <4nhe0kfdt0.fsf@lockgroove.bwh.harvard.edu> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" X-Trace: sea.gmane.org 1070391861 19874 80.91.224.253 (2 Dec 2003 19:04:21 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 2 Dec 2003 19:04:21 +0000 (UTC) Cc: ding@gnus.org Original-X-From: ding-owner+M3601@lists.math.uh.edu Tue Dec 02 20:04:18 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1ARFon-0008LM-00 for ; Tue, 02 Dec 2003 20:04:17 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1ARFoX-0003ep-00; Tue, 02 Dec 2003 13:04:01 -0600 Original-Received: from justine.libertine.org ([66.139.78.221] ident=postfix) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1ARFho-0003e2-00 for ding@lists.math.uh.edu; Tue, 02 Dec 2003 12:57:04 -0600 Original-Received: from pacific-carrier-annex.mit.edu (PACIFIC-CARRIER-ANNEX.MIT.EDU [18.7.21.83]) by justine.libertine.org (Postfix) with ESMTP id 5AE673A0037 for ; Tue, 2 Dec 2003 12:57:03 -0600 (CST) Original-Received: from grand-central-station.mit.edu (GRAND-CENTRAL-STATION.MIT.EDU [18.7.21.82]) by pacific-carrier-annex.mit.edu (8.12.4/8.9.2) with ESMTP id hB2Iuv4I027886; Tue, 2 Dec 2003 13:56:57 -0500 (EST) Original-Received: from melbourne-city-street.mit.edu (MELBOURNE-CITY-STREET.MIT.EDU [18.7.21.86]) by grand-central-station.mit.edu (8.12.4/8.9.2) with ESMTP id hB2ItBpi022922; Tue, 2 Dec 2003 13:55:12 -0500 (EST) Original-Received: from everett.mit.edu (111-9-236.dynamic.lcs.mit.edu [18.111.9.236]) ) by melbourne-city-street.mit.edu (8.12.4/8.12.4) with ESMTP id hB2It9rf025081; Tue, 2 Dec 2003 13:55:10 -0500 (EST) Original-To: Xavier Maillard Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAFVBMVEUGBgS83KhoOivUeFmC pXj4o39qeFZD1JPmAAACOklEQVR4nE2UQY/bIBCFkVPlbpX0bA2tz7sQcl5SlnsWc7cclv// E/oGjNNxokjz8eYxA7HwR9xLKQGRC4eQezqUj7KWnDOzAOB3EjaBKDl4KPARzkvpAIqoUTwI a2op6WVoeXHOlfgsnJPM1kl0SdMIFEJ0AaIJshiRhkCsHbC/51LycKDmgiw2jVJy9B9IDPNN U5NkSMQoR+c5/zTW3qjZgwg5ylpJ2YT4wubO7FEVEs0NhvPJaqzxDaDS2gUpGUjyDkau9NxB fKsmu4Ir2XSpwAjxYwfsfeqlbJwqcAwwcLINxGRWtIhhMEB7ZLoAJudd4aE42e4RCQDjYMXK CpS6XOBtNM7EVfPAbRgihfU08Sg9jqIr9G9JJuo70cCty66gwY6kzJ9xIygyJs7mDNQcSEUV iPSE1qt5mLgUkcGj8ExTbrvii2CwG3UCQkxUXDUPE03KIqPw8Iq33IBfUcbw/Ez9eeo8VvPr qrRNj2RU1DjeaN6cG8c6RKVNnTg+GIDWfAkPBVdK2jC84fCq4n4yik+VNICNtoPlPkScrKmh bPw+FIGSYmBrOUPeNfATjajPpFXtMl0Hvyt4imfzqDvC9zZ3cAEYvlt/ADq7rvDbNKj3RuxC B7gsZRWzbSAaymNXSPxjia8id1GmV6kUyjDMf9vd3daXAiYFd4GvXNzmaT7AspRyJnoif6Nn Ob8UC94fN6KybTSbvB0e46d3Tyzm+I7v4QWW5WGswZulXFNyv/4DGFvqYRv4BxSm28ZwQ4mv AAAAAElFTkSuQmCC X-Face: &OO..E]'&$Q?jm[7H$]lCS"t#FD[|jK.HfvP#![KZ`/,O)lQ(pE<2QS1i~K3+u\{Bh"(4h4G0W#R-F2zZNO6n\!8bk;tu%)f>y{C` P*^7yM`e/x"b1q[?^-9zxTp In-Reply-To: <4nhe0kfdt0.fsf@lockgroove.bwh.harvard.edu> (Ted Zlatanov's message of "Mon, 01 Dec 2003 16:06:35 -0500") User-Agent: Gnus/5.1003 (Gnus v5.10.3) XEmacs/21.4 (Reasonable Discussion, linux) Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:55062 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:55062 --=-=-= Ted Zlatanov writes: > I am considering adding spam *recognition* when you enter a group, > which would be useful for NNTP groups which have no splitting. > Unseen articles would be checked against a blacklist, for instance. Oh, I already have code to do that; I don't use it for nntp, but I do use it for another [MIT-local] read-only backend. --=-=-= Content-Type: application/emacs-lisp Content-Disposition: inline ;; Look for spam in discuss groups. (defun dzm-gnus-find-spam () (mapcar (lambda (article) (let ((split-group (with-temp-buffer (gnus-request-article-this-buffer article gnus-newsgroup-name) (spam-split)))) (if (equal split-group spam-split-group) (gnus-summary-mark-article article gnus-spam-mark)))) gnus-newsgroup-articles)) (defun dzm-gnus-nndsc-spam () (if (equal (car (gnus-find-method-for-group gnus-newsgroup-name)) 'nndsc) (dzm-gnus-find-spam))) (add-hook 'gnus-summary-prepare-hook 'dzm-gnus-nndsc-spam) --=-=-= ...so the core of this, in dzm-gnus-find-spam, downloads each article in turn into a temporary buffer, calls spam-split, and sees if the result is spam-split-group; if so, it marks the message as spam. This seems to work pretty well in practice. -- David Maze dmaze@mit.edu http://www.mit.edu/~dmaze/ "Theoretical politics is interesting. Politicking should be illegal." -- Abra Mitchell --=-=-=--