From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.user/5431 Path: news.gmane.org!not-for-mail From: David Kastrup Newsgroups: gmane.emacs.gnus.user Subject: Bad encoding... Date: Fri, 22 Jul 2005 23:10:23 +0200 Organization: Organization?!? Message-ID: <853bq6bl8g.fsf@lola.goethe.zz> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1138671151 27379 80.91.229.2 (31 Jan 2006 01:32:31 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 31 Jan 2006 01:32:31 +0000 (UTC) Original-X-From: nobody Tue Jan 17 17:35:16 2006 Original-Path: quimby.gnus.org!news.ccs.neu.edu!news.dfci.harvard.edu!news.cis.ohio-state.edu!usenet01.sei.cmu.edu!newsfeed.news.ucla.edu!newsfeed.stanford.edu!headwall.stanford.edu!newsfeed.news2me.com!newsfeed.icl.net!newsfeed.fjserv.net!newsfeed.freenet.de!newsfeed.arcor.de!news.arcor.de!not-for-mail Original-Newsgroups: gnu.emacs.gnus X-Face: 2FEFf>]>q>2iw=B6,xrUubRI>pR&Ml9=ao@P@i)L:\urd*t9M~y1^:+Y]'C0~{mAl`oQuAl \!3KEIp?*w`|bL5qr,H)LFO6Q=qx~iH4DN;i";/yuIsqbLLCh/!U#X[S~(5eZ41to5f%E@'ELIi$t^ Vc\LWP@J5p^rst0+('>Er0=^1{]M9!p?&:\z]|;&=NP3AhB!B_bi^]Pfkw User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) Cancel-Lock: sha1:O/FtHjTqChsYQVwD6qhmaMyTuyA= Original-NNTP-Posting-Date: 22 Jul 2005 23:10:24 MEST Original-NNTP-Posting-Host: fa4d962e.newsread4.arcor-online.net Original-X-Trace: DXC=5]oDZYPB2g8;iAbK4FA[l4:ejgIfPPld4jW\KbG]kaM8AV6U:Z=fE=?b>63Vj@R3T01_LiI6ENVa=3>5MOK` Take for example Message-ID: from de.talk.romance Here are some seminal points: Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0070_01C58F0E.65ADA4C0" This is a multi-part message in MIME format. ------=_NextPart_000_0070_01C58F0E.65ADA4C0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Immer wieder trifft man Menschen, die gezielt handeln. In vielen = F=E4llen ist es meistens die Tendenz zum Negativen. Was hat es auf sich? = Now what happens with this on my System? Coding system for saving this buffer: = -- emacs-mule-unix Default coding system (for new files): u -- mule-utf-8 (alias: utf-8) Coding system for keyboard input: nil Coding system for terminal output: u -- utf-8 (alias of mule-utf-8) Defaults for subprocess I/O: decoding: u -- mule-utf-8 (alias: utf-8) encoding: u -- mule-utf-8 (alias: utf-8) Priority order for recognizing coding systems when reading files: 1. mule-utf-8 (alias: utf-8) 2. iso-latin-1 (alias: iso-8859-1 latin-1) [...] My locale is LANG=en_US.UTF-8 LC_CTYPE="en_US.UTF-8" LC_NUMERIC="en_US.UTF-8" LC_TIME="en_US.UTF-8" LC_COLLATE="en_US.UTF-8" LC_MONETARY="en_US.UTF-8" LC_MESSAGES="en_US.UTF-8" LC_PAPER="en_US.UTF-8" LC_NAME="en_US.UTF-8" LC_ADDRESS="en_US.UTF-8" LC_TELEPHONE="en_US.UTF-8" LC_MEASUREMENT="en_US.UTF-8" LC_IDENTIFICATION="en_US.UTF-8" LC_ALL= The above now seemingly assembles the above quoted-printable sequences, generates the right latin-1 characters from them, converts them into utf-8, interprets the resulting bytes as latin-1 and converts this reinterpretation then into Emacs MULE, showing a buffer that looks like utf-8 ending up by accident in a latin-1 buffer (the buffer encoding indeed is latin-1). And indeed, recode-region from "buffer was originally encoded as utf-8" to "latin-1" yields the correct result. I get this kind of nonsense not too rarely, most often with multi-part messages that specify an encoding per-part. I think this has been happening in Emacs CVS for at least half a year or so, but it is also possible that the behavior just was not noticed before by me. -- David Kastrup, Kriemhildstr. 15, 44793 Bochum