From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/53108 Path: main.gmane.org!not-for-mail From: Jesper Harder Newsgroups: gmane.emacs.gnus.general Subject: Re: spam-stat and base64 encoded messages Date: Wed, 11 Jun 2003 21:42:36 +0200 Organization: http://purl.org/harder/ Sender: ding-owner@lists.math.uh.edu Message-ID: References: <03of1dyplo.fsf@msgid.viggen.net> <4nof1cguhz.fsf@holmes.bwh.harvard.edu> <4nn0gv28m3.fsf@lockgroove.bwh.harvard.edu> <0365niyeq8.fsf@msgid.viggen.net> <4n1xy32ey5.fsf@lockgroove.bwh.harvard.edu> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1055360604 23396 80.91.224.249 (11 Jun 2003 19:43:24 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 11 Jun 2003 19:43:24 +0000 (UTC) Cc: John Owens Original-X-From: ding-owner+M1652@lists.math.uh.edu Wed Jun 11 21:43:20 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19QBU9-0005zA-00 for ; Wed, 11 Jun 2003 21:42:17 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19QBWS-000455-00; Wed, 11 Jun 2003 14:44:40 -0500 Original-Received: from sclp3.sclp.com ([64.157.176.121]) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19QBWM-00044u-00 for ding@lists.math.uh.edu; Wed, 11 Jun 2003 14:44:34 -0500 Original-Received: (qmail 56196 invoked by alias); 11 Jun 2003 19:44:34 -0000 Original-Received: (qmail 56191 invoked from network); 11 Jun 2003 19:44:33 -0000 Original-Received: from quimby.gnus.org (80.91.224.244) by sclp3.sclp.com with SMTP; 11 Jun 2003 19:44:33 -0000 Original-Received: from news by quimby.gnus.org with local (Exim 3.12 #1 (Debian)) id 19QBou-00060W-00 for ; Wed, 11 Jun 2003 22:03:44 +0200 Original-To: ding@gnus.org Original-Path: localhost.localdomain!nobody Original-Newsgroups: gnus.ding Original-Lines: 35 Original-NNTP-Posting-Host: 0xc3f9823b.esnxr3.ras.tele.dk Original-X-Trace: quimby.gnus.org 1055361824 23095 195.249.130.59 (11 Jun 2003 20:03:44 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: 11 Jun 2003 20:03:44 GMT X-Face: ^RrvqCr7c,P$zTR:QED"@h9+BTm-"fjZJJ-3=OU7.)i/K]<.J88}s>'Z_$r; writes: > On Mon, 09 Jun 2003, harder@myrealbox.com wrote: >> I think that to make this work correctly, you'll need to parse the >> MIME structure of the message, and then apply the proper decoding to >> the approriate parts. > > Hmm, are you sure we need to do full MIME parsing? That would slow > down the incoming mail splitting a lot, I would think. But I don't > know all the Gnus MIME parsing functionality, or how fast it is. text/plain is handled specially [see below] (for performance reasons, I guess) -- so I don't think there's a lot of extra overhead for that case. There's some overhead for other MIME types. But OTOH handling them better could also make it faster, i.e. there's no reason for spam-stat.el to waste time analyzing an attached image (which I think it's doing now). > John Owens (cc-ed on this) was asking about forwarded spam messages, > which are inside an envelope from SpamAssassin. That's another case > where spam-split or spam-stat-split has to do a lot of parsing. Maybe > there's a better way? Try playing with `mm-dissect-buffer' which returns a tree of the MIME structure. > Referring to the decode-if-needed part, is the > gnus-article-decode-hook going to try decoding content even if it's > plain text or is there some detection done? `gnus-article-decode-hook' is for text/plain _only_ (and headers). So yes, it will decode plain text. It's used because text/plain isn't handled by the normal MIME machinery as mentioned above.