From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/61653 Path: news.gmane.org!not-for-mail From: David Kastrup Newsgroups: gmane.emacs.gnus.general Subject: Re: Where to complain? Date: Tue, 10 Jan 2006 10:05:44 +0100 Message-ID: <85hd8c78rb.fsf@lola.goethe.zz> References: <85u0cc7d4r.fsf@lola.goethe.zz> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1136884218 3173 80.91.229.2 (10 Jan 2006 09:10:18 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 10 Jan 2006 09:10:18 +0000 (UTC) Cc: ding@gnus.org Original-X-From: ding-owner+m10185@lists.math.uh.edu Tue Jan 10 10:10:16 2006 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by ciao.gmane.org with esmtp (Exim 4.43) id 1EwFVw-0004aA-EV for ding-account@gmane.org; Tue, 10 Jan 2006 10:10:00 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu ident=lists) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1EwFVs-0003zk-00; Tue, 10 Jan 2006 03:09:56 -0600 Original-Received: from nas01.math.uh.edu ([129.7.128.39]) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1EwFUi-0003zf-00 for ding@lists.math.uh.edu; Tue, 10 Jan 2006 03:08:44 -0600 Original-Received: from quimby.gnus.org ([80.91.224.244]) by nas01.math.uh.edu with esmtp (Exim 4.52) id 1EwFUg-0005hj-TI for ding@lists.math.uh.edu; Tue, 10 Jan 2006 03:08:44 -0600 Original-Received: from fencepost.gnu.org ([199.232.76.164]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1EwFUg-0005AZ-00 for ; Tue, 10 Jan 2006 10:08:42 +0100 Original-Received: from localhost ([127.0.0.1] helo=lola.goethe.zz) by fencepost.gnu.org with esmtp (Exim 4.34) id 1EwFSd-00039r-Ir; Tue, 10 Jan 2006 04:06:35 -0500 Original-Received: by lola.goethe.zz (Postfix, from userid 1002) id 712E11C4F93C; Tue, 10 Jan 2006 10:05:45 +0100 (CET) Original-To: Katsumi Yamaoka In-Reply-To: (Katsumi Yamaoka's message of "Tue, 10 Jan 2006 17:36:51 +0900") User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) X-Spam-Score: -2.6 (--) Precedence: bulk Original-Sender: ding-owner@lists.math.uh.edu Xref: news.gmane.org gmane.emacs.gnus.general:61653 Archived-At: Katsumi Yamaoka writes: >>>>>> In <85u0cc7d4r.fsf@lola.goethe.zz> David Kastrup wrote: > >> when Gnus is rendering HTML mail parts in an utf-8 language setting, >> it replaces all (properly declared) latin-1 characters in the HTML >> mail part with spaces. If one uses K B to select the text >> alternative, those characters render fine. > >> So what software subsystem is involved with the HTML rendering, and >> where would one report it? > > I don't know what does it, but such a behavior is performed by > the HTML renderer specified in the `mm-text-html-renderer' > variable; `w3m', `links' and `lynx' work fine, AFAIK. w3m-standalone in my case, which would be the standard setting. So it would appear that the character encoding (which is part of the MIME part declaration) is not passed to w3m or something? w3m alone used on web pages outside of Emacs appears to work. Maybe something is wrong in mm-view.el? It would appear that the following is responsible for the stuff: (defun mm-inline-render-with-stdin (handle post-func cmd &rest args) (let ((source (mm-get-part handle))) (mm-insert-inline handle (mm-with-unibyte-buffer (insert source) (apply 'mm-inline-wash-with-stdin post-func cmd args) (buffer-string))))) Now what happens with the unibyte-buffer? Perhaps the mm-inline-wash-with-stdin process filter is not too happy about inserting w3m output in utf-8 (the current locale) into a unibyte buffer? Or buffer-string, which likely happens to be unibyte decoded into utf-8, then gets interpreted as latin-1, the encoding of the mime part? Or w3m tries interpreting the latin-1 encoded input as utf-8? There seem many ways to get this wrong. When the locale and language environment and part encoding are all latin-1, stuff works. -- David Kastrup, Kriemhildstr. 15, 44793 Bochum