From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/81722 Path: news.gmane.org!not-for-mail From: Jan Tatarik Newsgroups: gmane.emacs.gnus.general Subject: Re: Scoring on basee64 encoded message body Date: Wed, 11 Apr 2012 21:34:48 +0200 Message-ID: <5n5x2rehrudpuv.fsf@nb-jtatarik2.xing.hh> References: <5n5x2rvcm5zibm.fsf@nb-jtatarik2.xing.hh> <5n5x2r8virwm8j.fsf@nb-jtatarik2.xing.hh> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1334172981 14425 80.91.229.3 (11 Apr 2012 19:36:21 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 11 Apr 2012 19:36:21 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M30001@lists.math.uh.edu Wed Apr 11 21:36:19 2012 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1SI3Ko-0007Mw-Vf for ding-account@gmane.org; Wed, 11 Apr 2012 21:36:07 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1SI3Jl-00044V-S3; Wed, 11 Apr 2012 14:35:01 -0500 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1SI3Jk-00044L-9s for ding@lists.math.uh.edu; Wed, 11 Apr 2012 14:35:00 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.76) (envelope-from ) id 1SI3Jj-0002bT-4z for ding@lists.math.uh.edu; Wed, 11 Apr 2012 14:34:59 -0500 Original-Received: from mail-bk0-f44.google.com ([209.85.214.44]) by quimby.gnus.org with esmtp (Exim 4.72) (envelope-from ) id 1SI3Jh-0006XF-L5 for ding@gnus.org; Wed, 11 Apr 2012 21:34:57 +0200 Original-Received: by bkuw5 with SMTP id w5so1551505bku.17 for ; Wed, 11 Apr 2012 12:34:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:subject:references:date:in-reply-to:message-id:user-agent :mime-version:content-type; bh=l4adA5J6r/NVbHUbeHdtN/2+RuScpPaOCpkldragBbw=; b=uDVlAg5K2vg85oD0OngEkwQT/kUyOxbcm/HcMOmywPmvW46eT4g+lhQz0Kjd61wGD5 iD0ABf61zwdI0g5AZ1oOTQ/K0yONsZHVCwjK45SiMeel+Y0lYvqDsgjdMQq4piRdAnlz FtxKpvljL3VhA3xL7dRNmPX+/sTA90cZDMr9DBBhahmRmzYD+2hcskvnph+wSrW5TSyT rSJ2ndp2Jl46pJba5xRlbBpFkFhsPJtl0OhqCb3K1/8cRJQm4dqHKYT+FjhkoMUjKgF7 CAaLWysr0+y/dHcCMBEySCqQ6NhC0BN9tnBqcv6gxRehADXrn62bubuPB/ed24ehW+VZ FWug== Original-Received: by 10.204.151.198 with SMTP id d6mr6266788bkw.122.1334172892224; Wed, 11 Apr 2012 12:34:52 -0700 (PDT) Original-Received: from nb-jtatarik2.xing.hh (e183236102.adsl.alicedsl.de. [85.183.236.102]) by mx.google.com with ESMTPS id u5sm6712230bka.5.2012.04.11.12.34.49 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 11 Apr 2012 12:34:50 -0700 (PDT) In-Reply-To: (Lars Magne Ingebrigtsen's message of "Tue, 10 Apr 2012 21:32:08 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.1.50 (gnu/linux) X-Spam-Score: -3.0 (---) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:81722 Archived-At: On Tue, Apr 10 2012, Lars Magne Ingebrigtsen Lars Magne Ingebrigtsen wrote: > Yes. And I almost applied it, but then I remembered that this > wouldn't, in general, really give us what we wanted, anyway. :-/ > That is, a lot of messages are MIME multipart messages. If we start > decoding, we really should decode all the parts, and score on the > decoded text of them all, I think. All text/* parts, I assume? I had a quick look at mm handling in gnus. My idea would be to - run mm-dissect-buffer on the message body (any idea which args would be appropriate?) - for multipart messages, pick the handles with text/* type, run them through their respective mm-inline-* function as defined in mm-inline-media-tests - score on all the decoded text parts Is this the way to go?