From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/67830 Path: news.gmane.org!not-for-mail From: Stefan Monnier Newsgroups: gmane.emacs.gnus.general Subject: Re: bug#1174: 23.0.60; Some UTF-8 mails displaying wrongly in Emacs 23 Date: Sat, 29 Nov 2008 16:30:49 -0500 Message-ID: References: <871vyf6q40.fsf@marauder.physik.uni-ulm.de> <87k5ampwej.fsf@marauder.physik.uni-ulm.de> <878wr2ohde.fsf@mocca.josefsson.org> <87d4geim7p.fsf@marauder.physik.uni-ulm.de> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1227994344 3808 80.91.229.12 (29 Nov 2008 21:32:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Sat, 29 Nov 2008 21:32:24 +0000 (UTC) Cc: Simon Josefsson , Frank Schmitt , James Cloos , ding@gnus.org, 1174@emacsbugs.donarmstrong.com, Clemens Schueller To: Reiner Steib Original-X-From: ding-owner+M16277@lists.math.uh.edu Sat Nov 29 22:33:29 2008 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.50) id 1L6XRP-0006qk-Sx for ding-account@gmane.org; Sat, 29 Nov 2008 22:33:28 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1L6XPT-0008W4-9l; Sat, 29 Nov 2008 15:31:27 -0600 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1L6XPR-0008Vr-RW for ding@lists.math.uh.edu; Sat, 29 Nov 2008 15:31:25 -0600 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtp (Exim 4.69) (envelope-from ) id 1L6XPO-0008G9-LV for ding@lists.math.uh.edu; Sat, 29 Nov 2008 15:31:25 -0600 Original-Received: from ironport2-out.pppoe.ca ([206.248.154.182] helo=ironport2-out.teksavvy.com) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1L6XPZ-0007TS-00 for ; Sat, 29 Nov 2008 22:31:33 +0100 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApsEAJ5DMUnO+Jkl/2dsb2JhbACBbcwHgn2BJA X-IronPort-AV: E=Sophos;i="4.33,687,1220241600"; d="scan'208";a="30437319" Original-Received: from 206-248-153-37.dsl.teksavvy.com (HELO pastel.home) ([206.248.153.37]) by ironport2-out.teksavvy.com with ESMTP; 29 Nov 2008 16:30:49 -0500 Original-Received: by pastel.home (Postfix, from userid 20848) id BF5A284C0; Sat, 29 Nov 2008 16:30:49 -0500 (EST) In-Reply-To: <87d4geim7p.fsf@marauder.physik.uni-ulm.de> (Reiner Steib's message of "Sat, 29 Nov 2008 16:30:18 +0100") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (gnu/linux) X-Spam-Score: -2.6 (--) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:67830 Archived-At: > I never saw this problem myself. I cannot see any difference with a > few UTF-8 articles (C-T-E: 8bit, [1]), neither with Emacs 22 (with > current Gnus trunk) or Emacs trunk (Gnus 5.13 from there). Stefan, > what are the "accidental emacs-mule-valid byte sequences" that trigger > this problem? It would be good if someone could send me a problematic > article. [2] In Emacs-22, the problem was more difficult to trigger: you had to receive an email whose undecoded text contained emacs-mule escape sequences, which is rather uncommon. With Emacs-23, it's a lot more common since the internal encoding has changed to a variant of utf-8: an 8bit body using utf-8 will see its content unwillingly decoded during nnimap-demule which leads to the bugs we've seen recently. I'm pretty sure that string-as-multibyte is wrong here in general. Maybe the problem is that nnimap-demule is used blindly in different contexts where some need string-to-multibyte and some need string-as-multibyte. E.g. maybe Simon's problem was linked to imap groups with non-ASCII chars in their names, rather than in the message bodies. Stefan