From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.user/9591 Path: news.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.gnus.user Subject: Re: Encoding of messages Date: Mon, 27 Aug 2007 10:16:35 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1188229233 25257 80.91.229.12 (27 Aug 2007 15:40:33 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 27 Aug 2007 15:40:33 +0000 (UTC) Cc: Ding Mailing List To: info-gnus-english@gnu.org Original-X-From: info-gnus-english-bounces+gegu-info-gnus-english=m.gmane.org@gnu.org Mon Aug 27 17:40:30 2007 Return-path: Envelope-to: gegu-info-gnus-english@m.gmane.org Original-Received: from lists.gnu.org ([199.232.76.165]) by lo.gmane.org with esmtp (Exim 4.50) id 1IPghT-0004Fn-9a for gegu-info-gnus-english@m.gmane.org; Mon, 27 Aug 2007 17:40:23 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1IPghS-0006g5-L1 for gegu-info-gnus-english@m.gmane.org; Mon, 27 Aug 2007 11:40:22 -0400 Original-Path: shelby.stanford.edu!newsfeed.stanford.edu!news.tele.dk!news.tele.dk!small.news.tele.dk!multikabel.net!feed20.multikabel.net!newsfeed.freenet.de!news.albasani.net!not-for-mail Original-Newsgroups: gnu.emacs.gnus Original-Lines: 46 Original-X-Trace: news.albasani.net VX84p6mcl8fgaoYncYwFZ2BOzcUVMVYlkiiruSrmYxzK7/vs5ymBp1v/f9OEzXNNU7eTmEVpCyV/rfm4kzmBq+73FsW0VAYlVKNDkA0O4lNmN16mF2MSYy/R+XB5n6Oj Original-X-Complaints-To: abuse@albasani.net Original-NNTP-Posting-Date: Mon, 27 Aug 2007 15:16:39 +0000 (UTC) X-User-ID: hTE9LrJpKT6kolqepS59FkW21e3HbXrVEtdh658Rt6o= X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6; d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" Cancel-Lock: sha1:RMUUZizyrh7nIJ0G6Rs9WgPIMGg= sha1:hC38oPi2F18dWbeUXcF0sEWGUSc= User-Agent: Gnus/5.110007 (No Gnus v0.7) Emacs/22.1.50 (darwin) X-NNTP-Posting-Host: gS96i02Yr2w7fwBu+KYlqU5EMFxOpLgo9SCj1kWOt28= Original-Xref: shelby.stanford.edu gnu.emacs.gnus:79783 X-BeenThere: info-gnus-english@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Announcements and discussions for GNUS, the GNU Emacs Usenet newsreader \(in English\)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: info-gnus-english-bounces+gegu-info-gnus-english=m.gmane.org@gnu.org Errors-To: info-gnus-english-bounces+gegu-info-gnus-english=m.gmane.org@gnu.org Xref: news.gmane.org gmane.emacs.gnus.user:9591 Archived-At: On Wed, 22 Aug 2007 16:28:38 +0200 eric.boese-wolf@t-online.de (Eric Böse-Wolf) wrote: EB> Ted Zlatanov writes: >> Gnus does not convert messages for storage AFAIK. What are you trying >> to accomplish? EB> I use spam.el with spam-use-bogofilter (bogofilter configured with EB> unicode=yes and default_charset=utf-8) and I noticed correctly EB> displayed german umlauts and badly displayed german umlauts in my EB> bogoutil -d wordlist.db output. So I assumed bogofilter received EB> messages in different encodings and wanted to stop that to get EB> a nice and plain bogoutil -d wordlist.db output. (I maintain spam.el) The function that gets the article as a string is (defun spam-get-article-as-string (article) (when (numberp article) (with-temp-buffer (gnus-request-article-this-buffer article gnus-newsgroup-name) (buffer-string)))) and then that string gets passed to bogofilter: (when (stringp article-string) (with-temp-buffer (insert article-string) (apply 'call-process-region (point-min) (point-max) spam-bogofilter-program nil nil nil switch (if db `("-d" ,db "-v") `("-v"))))))) So spam.el just passes the article as a string, using the gnus-request-article-this-buffer function. We can change this if necessary, to add extra parameters to bogofilter's invocation or to spam-get-article-as-string to encode to UCS/UTF-8. I don't know much about Gnus article encodings so I'm cc-ing this to the ding mailing list as well in case someone can help. Ted p.s. your English is fine :)