From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/67837 Path: news.gmane.org!not-for-mail From: Alexandre Oliva Newsgroups: gmane.emacs.gnus.general Subject: gnus saves text/html part in wrong encoding before links -dump Date: Tue, 02 Dec 2008 00:58:59 -0200 Organization: Free thinker, not speaking for University of Campinas Message-ID: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1228186820 9492 80.91.229.12 (2 Dec 2008 03:00:20 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 2 Dec 2008 03:00:20 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M16284@lists.math.uh.edu Tue Dec 02 04:01:23 2008 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.50) id 1L7LVq-0002A4-IY for ding-account@gmane.org; Tue, 02 Dec 2008 04:01:22 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1L7LTq-0004d3-4V; Mon, 01 Dec 2008 20:59:18 -0600 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1L7LTn-0004ci-D2 for ding@lists.math.uh.edu; Mon, 01 Dec 2008 20:59:15 -0600 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtp (Exim 4.69) (envelope-from ) id 1L7LTk-0003YD-GV for ding@lists.math.uh.edu; Mon, 01 Dec 2008 20:59:15 -0600 Original-Received: from lsd-gw.ic.unicamp.br ([143.106.7.165] helo=boneca.lsd.ic.unicamp.br) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1L7LTs-00013S-00 for ; Tue, 02 Dec 2008 03:59:20 +0100 Original-Received: from freie.oliva.athome.lsd.ic.unicamp.br (gw-to-emilia.oliva.athome.lsd.ic.unicamp.br [172.31.160.17] (may be forged)) by boneca.lsd.ic.unicamp.br (8.14.2/8.14.2) with ESMTP id mB22x3ep002887; Tue, 2 Dec 2008 00:59:03 -0200 Original-Received: from livre.oliva.athome.lsd.ic.unicamp.br (livre-to-gw.oliva.athome.lsd.ic.unicamp.br [172.31.160.19]) by freie.oliva.athome.lsd.ic.unicamp.br (8.14.2/8.14.2) with ESMTP id mB22x2rT024769; Tue, 2 Dec 2008 00:59:02 -0200 Original-Received: from livre.oliva.athome.lsd.ic.unicamp.br (localhost.localdomain [127.0.0.1]) by livre.oliva.athome.lsd.ic.unicamp.br (8.14.3/8.14.3) with ESMTP id mB22x1c7006198; Tue, 2 Dec 2008 00:59:01 -0200 Original-Received: (from aoliva@localhost) by livre.oliva.athome.lsd.ic.unicamp.br (8.14.3/8.14.3/Submit) id mB22x0OI006194; Tue, 2 Dec 2008 00:59:00 -0200 User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.2 (gnu/linux) X-Spam-Score: -3.3 (---) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:67837 Archived-At: Hi there, I've just got a chance to investigate why text/html with non-ascii characters was displayed incorrectly on Fedora 10's GNU Emacs. Turns out gnus selects links to convert html to text, but then, when it's fed an iso-8859-1 message encoded with e.g. quoted-printable, it converts that to iso-8859-1, saves it in a file, and then runs links -dump. Unfortunately, elinks-0.12 (unlike 0.11) won't deal with ISO-8859-1 when LANG is *@utf-8, not even with -dump-charset set appropriately. It seems to me that we try to convert stuff to the system charset before running links -dump, but it's not making any chances whatsoever: the file ends up in ISO-8859-1, and then we get output back from links that's corrupted. More details at https://bugzilla.redhat.com/show_bug.cgi?id=474069 -- Alexandre Oliva http://www.lsd.ic.unicamp.br/~oliva/ You must be the change you wish to see in the world. -- Gandhi Be Free! -- http://FSFLA.org/ FSF Latin America board member Free Software Evangelist Red Hat Brazil Compiler Engineer