From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-2.1 required=5.0 tests=DKIM_INVALID,DKIM_SIGNED, RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 7953 invoked from network); 22 Mar 2023 14:37:32 -0000 Received: from mx1.math.uh.edu (129.7.128.32) by inbox.vuxu.org with ESMTPUTF8; 22 Mar 2023 14:37:32 -0000 Received: from lists1.math.uh.edu ([129.7.128.208]) by mx1.math.uh.edu with esmtps (TLS1.3) tls TLS_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1pezal-001F0m-6i for ml@inbox.vuxu.org; Wed, 22 Mar 2023 09:37:31 -0500 Received: from lists1.math.uh.edu ([127.0.0.1] helo=lists.math.uh.edu) by lists1.math.uh.edu with smtp (Exim 4.96) (envelope-from ) id 1pezak-001wAO-30 for ml@inbox.vuxu.org; Wed, 22 Mar 2023 09:37:31 -0500 Received: from mx2.math.uh.edu ([129.7.128.33]) by lists1.math.uh.edu with esmtp (Exim 4.96) (envelope-from ) id 1pezai-001wAI-2C for ding@lists.math.uh.edu; Wed, 22 Mar 2023 09:37:28 -0500 Received: from quimby.gnus.org ([95.216.78.240]) by mx2.math.uh.edu with esmtps (TLS1.3) tls TLS_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1pezag-00C340-Rj for ding@lists.math.uh.edu; Wed, 22 Mar 2023 09:37:28 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:Date:References: In-Reply-To:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=qg18iryny/Tpt5FhkN92xDOFVslc7VYpFYe4iuM2Kyc=; b=Hz15ftL3ohSek8Z04gmYtGFkHL +wPQWg1cglTJKc8Si7/7heLpFwxuF+Ld9cML3lnuDD4xWlMm29FSGahEmEvq9H4883Ctks1lczbdQ wEze/2JtMdrPNLdNMwnJPY56G6nzda1ZIq+DTWvm+r04AmDkj55zemrUMipbKCSHdR44=; Received: from cp06.nordicway.dk ([148.251.244.167]) by quimby.gnus.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1pezaZ-0003Ob-Gs for ding@gnus.org; Wed, 22 Mar 2023 15:37:21 +0100 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=defun.dk; s=default; h=Content-Type:MIME-Version:Message-ID:Date:References:In-Reply-To :Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=qg18iryny/Tpt5FhkN92xDOFVslc7VYpFYe4iuM2Kyc=; b=qVvCqANJPcLBuGVRjQTZ8CqmUZ /8hey+jSvo9g6aCLMBflXV4JqpyyrICK0/fZ2GpKLtRJjQNyQEQgug7AaMi+pd4RUZ9hoxbNey3Xc kNQsqV1MSk01FGJSttHsEI5DDc0gQ00VW+zsEo0intdVKcrG33mk1qmth25XLSjxXfvkxxqAz61rY Q9NysCXDgFMz+OLSPJLEA3p4gtqGkwlP1s/5gSypaKCPxlyW9tqbrR0EeTE36MIx7k9D5vyOWpI1s Mg+PzQw6ICdF8wWX3fZBjA5rqxMZzDXPW0WIcEALzwiHdNs64eKE7iQkGHvWrXVoBgZB1xqtQ3jLX 5P8ft3ag==; Received: from 5.186.54.212.static.fibianet.dk ([5.186.54.212]:50891 helo=iMac) by cp06.nordicway.dk with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.95) (envelope-from ) id 1pezaZ-005gO6-7a; Wed, 22 Mar 2023 15:37:17 +0100 From: Christian Lynbech To: Andrew Cohen Cc: ding@gnus.org Subject: Re: Handling spam In-Reply-To: <87v8itp5nm.fsf@ust.hk> (Andrew Cohen's message of "Wed, 22 Mar 2023 16:11:09 +0800") References: <87v8itp5nm.fsf@ust.hk> Date: Wed, 22 Mar 2023 15:37:17 +0100 Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) MIME-Version: 1.0 Content-Type: text/plain X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - cp06.nordicway.dk X-AntiAbuse: Original Domain - gnus.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - defun.dk X-Get-Message-Sender-Via: cp06.nordicway.dk: authenticated_id: christian@defun.dk X-Authenticated-Sender: cp06.nordicway.dk: christian@defun.dk X-Source: X-Source-Args: X-Source-Dir: List-ID: Precedence: bulk Thanks for the hint, I guess this is then shifting the analysis to work on the formatted message rather than the raw. It will actually not be hard to change my workflow to work on the formatted message (all uncaught spam is manually moved to a separate folder which I then process with a command I have written myself, this command takes care to go to the raw message but I can just not do that). /Christian ----------------------- On Wed, Mar 22 2023, Andrew Cohen wrote: >>>>> "CL" == Christian Lynbech writes: CL> Does any of you use gnus to handle spam, and if so, how do you CL> do it? I have for quite some time been using the spam-stat CL> library that is bundled with emacs, but it is not working so CL> well for me. [...] CL> So I am not sure what to do, either I need to teach the split CL> rule to look at the raw message or I need to retrain my spam CL> detection on formatted messages, something I can certainly do CL> but which perhaps is less efficient in distinguishing between CL> spam and non-spam. Certainly, being able to quickly process CL> whole directories is rather convenient. You need to look at 'spam-stat-washing-hook: "Hook applied to each message before analysis." With this you can manipulate the article before the spam analysis. A common choice is (require 'spam-wash) (add-hook 'spam-stat-washing-hook 'spam-wash) which will decode MIME encodings before doing the spam analysis. I don't recall if it deals with html email but you should be able to modify the function 'spam-wash easily enough to do what you want. Best, Andy -- Andrew Cohen