From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/65091 Path: news.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.gnus.general Subject: Re: Encoding of messages Date: Mon, 27 Aug 2007 10:16:35 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1188227857 20102 80.91.229.12 (27 Aug 2007 15:17:37 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 27 Aug 2007 15:17:37 +0000 (UTC) Cc: Ding Mailing List Original-X-From: ding-owner+M13599@lists.math.uh.edu Mon Aug 27 17:17:36 2007 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.50) id 1IPgLP-0003DM-B3 for ding-account@gmane.org; Mon, 27 Aug 2007 17:17:35 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1IPgL9-0001Fp-BK; Mon, 27 Aug 2007 10:17:19 -0500 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1IPgL7-0001FW-7x for ding@lists.math.uh.edu; Mon, 27 Aug 2007 10:17:17 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtp (Exim 4.67) (envelope-from ) id 1IPgL4-0004gf-8G for ding@lists.math.uh.edu; Mon, 27 Aug 2007 10:17:17 -0500 Original-Received: from mailx01.tmomail.net ([66.94.9.231]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1IPgL3-00034S-00 for ; Mon, 27 Aug 2007 17:17:13 +0200 Original-Received: from tzz.local (222.2.223.10.in-addr.arpa [10.223.2.222]) by mailx01.tmomail.net (8.12.11.20060308/8.12.11) with ESMTP id l7RFGdsr009608 for ; Mon, 27 Aug 2007 08:16:40 -0700 Original-Newsgroups: gnu.emacs.gnus X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" User-Agent: Gnus/5.110007 (No Gnus v0.7) Emacs/22.1.50 (darwin) Cancel-Lock: sha1:RMUUZizyrh7nIJ0G6Rs9WgPIMGg= Mail-Followup-To: Ding Mailing List In-Reply-To: ("Eric =?iso-8859-1?Q?B?= =?iso-8859-1?Q?=F6se-Wolf=22's?= message of "Wed, 22 Aug 2007 16:28:38 +0200") Posted-To: gnu.emacs.gnus X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 mlx=0 adultscore=0 adjust=0 reason=mlx engine=3.1.0-0708090000 definitions=main-0708270022 X-Spam-Score: -2.5 (--) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:65091 Archived-At: The following message is a courtesy copy of an article that has been posted to gnu.emacs.gnus as well. On Wed, 22 Aug 2007 16:28:38 +0200 eric.boese-wolf@t-online.de (Eric B=F6se= -Wolf) wrote:=20 EB> Ted Zlatanov writes: >> Gnus does not convert messages for storage AFAIK. What are you trying >> to accomplish? EB> I use spam.el with spam-use-bogofilter (bogofilter configured with EB> unicode=3Dyes and default_charset=3Dutf-8) and I noticed correctly EB> displayed german umlauts and badly displayed german umlauts in my EB> bogoutil -d wordlist.db output. So I assumed bogofilter received EB> messages in different encodings and wanted to stop that to get EB> a nice and plain bogoutil -d wordlist.db output. (I maintain spam.el) The function that gets the article as a string is (defun spam-get-article-as-string (article) (when (numberp article) (with-temp-buffer (gnus-request-article-this-buffer article gnus-newsgroup-name) (buffer-string)))) and then that string gets passed to bogofilter: (when (stringp article-string) (with-temp-buffer (insert article-string) (apply 'call-process-region (point-min) (point-max) spam-bogofilter-program nil nil nil switch (if db `("-d" ,db "-v") `("-v"))))))) So spam.el just passes the article as a string, using the gnus-request-article-this-buffer function. We can change this if necessary, to add extra parameters to bogofilter's invocation or to spam-get-article-as-string to encode to UCS/UTF-8. I don't know much about Gnus article encodings so I'm cc-ing this to the ding mailing list as well in case someone can help. Ted p.s. your English is fine :)