From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/67804 Path: news.gmane.org!not-for-mail From: Lloyd Zusman Newsgroups: gmane.emacs.gnus.general Subject: Re: Guidance concerning something I want to do with charsets Date: Wed, 19 Nov 2008 15:28:07 -0500 Organization: Take a hippopotamus to lunch today. Message-ID: References: <87k5azh9vw.fsf@marauder.physik.uni-ulm.de> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1227126555 1114 80.91.229.12 (19 Nov 2008 20:29:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 19 Nov 2008 20:29:15 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M16252@lists.math.uh.edu Wed Nov 19 21:30:19 2008 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.50) id 1L2tgm-0001mX-QB for ding-account@gmane.org; Wed, 19 Nov 2008 21:30:17 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1L2tey-0004zB-Sb; Wed, 19 Nov 2008 14:28:24 -0600 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1L2tex-0004yw-0h for ding@lists.math.uh.edu; Wed, 19 Nov 2008 14:28:23 -0600 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtp (Exim 4.69) (envelope-from ) id 1L2tet-0006tp-AM for ding@lists.math.uh.edu; Wed, 19 Nov 2008 14:28:22 -0600 Original-Received: from main.gmane.org ([80.91.229.2] helo=ciao.gmane.org) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1L2tf2-0007TK-00 for ; Wed, 19 Nov 2008 21:28:28 +0100 Original-Received: from list by ciao.gmane.org with local (Exim 4.43) id 1L2tep-0000Fg-IB for ding@gnus.org; Wed, 19 Nov 2008 20:28:15 +0000 Original-Received: from 209.59.210.144 ([209.59.210.144]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 19 Nov 2008 20:28:15 +0000 Original-Received: from ljz by 209.59.210.144 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 19 Nov 2008 20:28:15 +0000 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 89 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 209.59.210.144 User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.0.60 (darwin) Cancel-Lock: sha1:7c8TCDUMKJ0n1tRArv++n7Y2zCk= Cache-Post-Path: purity-of-essence.net!unknown@nadie.us X-Cache: nntpcache 3.0.2 (see http://www.nntpcache.com/) X-Spam-Score: -1.5 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:67804 Archived-At: Reiner Steib writes: > On Wed, Nov 19 2008, Lloyd Zusman wrote: > >> [ ... ] >> >> >> if Content-Type specifies UTF-8 >> use UTF-8 as the charset >> else >> use ISO-8859-1 as the charset > > That is the normal behavior of Gnus (and any other MIME-aware MUA). > If you have problems with that, there's either a bug in your config > (e.g. using Emacs in unibyte mode) or in Gnus. Well, I have indeed been using Emacs in unibyte mode. Ages ago (well, some time in the 1990's, I think), I started using that setting. Based on what you're saying here, I guess that's obsolete. I send emails to people who seem to be limited to unibyte ISO-8859-1 messages and not UTF-8. Perhaps their mailers are primitive and ignore the Content-Type header ... I'm not sure why. This is the reason for my having set this default, originally. I'm now trying to catch up on my knowledge of charsets, so please forgive my ignorance about this topic. I can see from what you wrote below that there's a way run Emacs in multibyte mode by default but to use ISO-8859-1 for dealing with messages > I can't think of a common use case for the `nil' case beside incorrect > charset labelling of articles (e.g. declaring iso-8859-1 when it is > utf-8). For this, you can use > `gnus-summary-show-article-charset-alist' and the numerical prefix for > `g': Yes, I do see incorrect charset labeling (or missing labeling altogether) in some articles and email messages. Is there a way to do this without the numerical prefix? I.e., some sort of hook that I can use to match the sender or newsgroup against a pattern and then force the buffer to be encoded via unibyte ISO-8859-1 if the pattern matches? > ,----[ (info "(gnus)Paging the Article") ] > | `A g' > | `g' > | [ ... etc. ... ] > `---- > >> 4. In all cases, the charset selected via Items 1, 2, and 3 should be >> used both for decoding the message for display *and* for encoding my >> replies and follow-ups. > > Why do you think that this is useful? Well, maybe the only reason I think this way is due to ignorance. Some of the recipients of my email messages and readers of my newsgroup posts (in a private, very small news server ... maybe 20 people) do not seem to be able to read UTF-8 encodings. This is why I want to force ISO-8859-1 when writing to these people or posting in that private newsgroup, whether these are replies or whether I'm initiating the message. >> Is this a common set of tasks which are easy to perform in gnus, or am I >> trying to do something that is as idiosyncratic as many of the other >> tasks that I tend to want to perform? > > Gnus already does The Right Thing by default. You can specify which > charsets to prefer via `mm-coding-system-priorities': OK. I think I understand. I'll dig into this further, and I'll see if I can fix my emacs so as not to use unibyte by default any longer, and then to make use of the features you have described. All that remains is for me to figure out how to write the hook that I mentioned above. Thank you very much. -- Lloyd Zusman ljz@asfast.com God bless you.