From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/45050 Path: main.gmane.org!not-for-mail From: Russ Allbery Newsgroups: gmane.emacs.gnus.general Subject: Re: nnml splitting on encoded headers Date: Thu, 30 May 2002 15:21:06 -0700 Organization: The Eyrie Sender: owner-ding@hpc.uh.edu Message-ID: References: NNTP-Posting-Host: localhost.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1022797316 4703 127.0.0.1 (30 May 2002 22:21:56 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Thu, 30 May 2002 22:21:56 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.33 #1 (Debian)) id 17DYIu-0001Dk-00 for ; Fri, 31 May 2002 00:21:56 +0200 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 17DYIX-0003pL-00; Thu, 30 May 2002 17:21:33 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Thu, 30 May 2002 17:21:47 -0500 (CDT) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id RAA21659 for ; Thu, 30 May 2002 17:21:37 -0500 (CDT) Original-Received: (qmail 12947 invoked by alias); 30 May 2002 22:21:17 -0000 Original-Received: (qmail 12942 invoked from network); 30 May 2002 22:21:17 -0000 Original-Received: from windlord.stanford.edu (171.64.13.23) by gnus.org with SMTP; 30 May 2002 22:21:17 -0000 Original-Received: (qmail 7233 invoked by uid 50); 30 May 2002 22:21:06 -0000 Original-To: ding@gnus.org In-Reply-To: (Mark Thomas's message of "Sun, 26 May 2002 08:32:10 -0400") Original-Lines: 33 User-Agent: Gnus/5.090005 (Oort Gnus v0.05) XEmacs/21.4 (Common Lisp, sparc-sun-solaris2.6) Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:45050 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:45050 Mark Thomas writes: > However, Gnus decodes the headers before running the split rules. Check > out this ChangeLog entry: > 2002-01-26 Lars Magne Ingebrigtsen > * nnmail.el (nnmail-article-group): Decode headers before running > split rules over them. > (nnmail-mail-splitting-charset): New variable. > and this snippet of code from nnmail-article-group: > ;; Decode MIME headers and charsets. > (let ((mail-parse-charset nnmail-mail-splitting-charset)) > (mail-decode-encoded-word-region (point-min) (point-max))) Um, that's an extremely serious bug for me. That means I can't upgrade to any newer version of Gnus unless there's some way to turn this off, as far and away the most successful spam filtering rules that I have are those that catch irregularities of the original, encoded or untagged 8-bit Subject line. Telling Gnus to re-encode before split rules apply won't cut it, I believe, unless that re-encoding leaves raw 8-bit that was originally in the Subject header alone. So... how do I turn this feature off? I can understand how this would be useful for people who can read other character sets, so I don't want to see it removed entirely, but it's a serious problem for me. -- Russ Allbery (rra@stanford.edu)