From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/81583 Path: news.gmane.org!not-for-mail From: Lars Magne Ingebrigtsen Newsgroups: gmane.emacs.gnus.general Subject: Re: Scoring on basee64 encoded message body Date: Wed, 14 Mar 2012 15:38:38 +0100 Message-ID: References: NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1331735951 13086 80.91.229.3 (14 Mar 2012 14:39:11 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 14 Mar 2012 14:39:11 +0000 (UTC) Cc: ding@lists.math.uh.edu To: Jan Tatarik Original-X-From: ding-owner+M29863@lists.math.uh.edu Wed Mar 14 15:39:08 2012 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1S7pLw-0000J1-CG for ding-account@gmane.org; Wed, 14 Mar 2012 15:39:00 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1S7pLt-0004DE-9z; Wed, 14 Mar 2012 09:38:57 -0500 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1S7pLs-0004D3-3e for ding@lists.math.uh.edu; Wed, 14 Mar 2012 09:38:56 -0500 Original-Received: from hermes.netfonds.no ([80.91.224.195]) by mx1.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.76) (envelope-from ) id 1S7pLq-0000MT-BE for ding@lists.math.uh.edu; Wed, 14 Mar 2012 09:38:56 -0500 Original-Received: from cm-84.215.51.58.getinternet.no ([84.215.51.58] helo=stories.gnus.org) by hermes.netfonds.no with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1S7pLb-0006UT-N9; Wed, 14 Mar 2012 15:38:39 +0100 Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAHlBMVEVbAhHDoLyKBh6FZoyI Bx6DBRw+LmBtVX1JQHRmEioMqZMvAAACZ0lEQVQ4jWWTwW6bQBCGR1Vx6lvXqyo9NrvQxjfDVIbc KuSo6i1C5h4lypYcEyLDtXUFe7Qit0retjMLTkwyEkLMN//M7OwARWNMYYt4brCal1Up5jfiHMgK 9q/reFVUK4CHUgiJB208A2OssRhfxjbhuHEoDisYjc7B2NaaNZZoVwzgIAyxWbchKX43pgiFrOcO gCmKtRACzH06vb/X2oHx4wbAUhiB5EIt/iiFoTyBcarlhiRCoIT4Qt3dqliHcgZvVSi+joxZi7oB MUn+lYhCTKaQ6lBEI9vYpm1ByMOCGhHhxH9cLAhcWjpy+BGSuqrwRiLidOtv3iAmPAo5h+pSxJQK MZpuEdOv3xNrSLECQb0hlk5xJPXsHWcyrQF5Q9NJ+EjTrRY6frCOGEgqIWX9i8DJVkuNyRNokeZB sslioSMHHAKJJQEqFG2XChXW5HcKKs2jkajGS6U02s56gBir4GyrlaqqXSqKFpEiC/KxRnrXHeFz TNjvB3mekUR96RW2FXdLJjlZqjoJg8KiOuXvUwL8VseuLwKlCjJXI892gBW0DL7KulzO3xchcBuk XagaguJWqVeArtYUVGEAVK/4GeQqH4KKuzLX1O0Q1LYD3ktFD668PDt7CWi65iLP/G/pPjjuil/R gc8GbR3XtqF2/w5Bzgq+QcMjgmeQOYVtHPBh0Fav8PI8AG/nDOj5zPsD13xB4GV74IfbknOPASz3 wHQf7IjPK8l/Ij0uV1/eZ/Ch+0uBJbv7VmoPOLQ8fQbvnwBly1gRvQKs4fAoOiIwg33zfJcspA3/ NACcbpGqMNJiw5//ATglC/SKqZGCAAAAAElFTkSuQmCC X-Now-Playing: Phranc's _I Enjoy Being A Girl_ In-Reply-To: (Jan Tatarik's message of "Tue, 13 Mar 2012 11:39:27 +0100") User-Agent: Gnus/5.130004 (Ma Gnus v0.4) Emacs/24.0.94 (gnu/linux) X-MailScanner-ID: 1S7pLb-0006UT-N9 MailScanner-NULL-Check: 1332340719.79454@uZmZRc8Xq36TLdqee6TxwQ X-Spam-Status: No X-Spam-Score: -1.9 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:81583 Archived-At: Jan Tatarik writes: > I finally realized the content of the messages is base64 encoded, so > matching on the raw body cannot work. > > The attached patch fixes the problem for me, but I have no idea > whether it's a generally acceptable solution. I'm only using the body > match in a low-traffic group, so speed is not an issue for me. [...] > + (when (string= (gnus-fetch-field "content-transfer-encoding") "base64") > + (article-de-base64-unreadable t)) This isn't a general enough solution here. QP-encoded messages also want decoding. But the more general issue is -- should scoring on bodies be done on the decoded messages or the encoded messages? I think it would make more sense to do it on decoded messages, and since these are body matches, speed don't really matter that much, because body matches are s-l-o-w anyway. Scoring on headers are done on the decoded headers, right? So it would be more consistent to decode the bodies, too. Wouldn't it? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog http://lars.ingebrigtsen.no/