From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/53094 Path: main.gmane.org!not-for-mail From: Jan Rychter Newsgroups: gmane.emacs.gnus.general Subject: Re: something decodes my headers... Date: Tue, 10 Jun 2003 17:20:32 -0700 Sender: ding-owner@lists.math.uh.edu Message-ID: References: NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1055295875 12935 80.91.224.249 (11 Jun 2003 01:44:35 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 11 Jun 2003 01:44:35 +0000 (UTC) Original-X-From: ding-owner+M1638@lists.math.uh.edu Wed Jun 11 03:44:31 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19Puf9-0003Li-00 for ; Wed, 11 Jun 2003 03:44:31 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19PuhQ-0000ne-00; Tue, 10 Jun 2003 20:46:52 -0500 Original-Received: from sclp3.sclp.com ([64.157.176.121]) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19PuhI-0000nW-00 for ding@lists.math.uh.edu; Tue, 10 Jun 2003 20:46:44 -0500 Original-Received: (qmail 6726 invoked by alias); 11 Jun 2003 01:46:43 -0000 Original-Received: (qmail 6721 invoked from network); 11 Jun 2003 01:46:43 -0000 Original-Received: from screech.rychter.com (212.87.11.114) by sclp3.sclp.com with SMTP; 11 Jun 2003 01:46:43 -0000 Original-Received: from tnuctip.rychter.com (unknown [10.197.0.2]) by screech.rychter.com (Postfix) with ESMTP id 6B3534A609 for ; Wed, 11 Jun 2003 03:46:21 +0200 (CEST) Original-Received: from tnuctip.rychter.com (localhost.localdomain [127.0.0.1]) by tnuctip.rychter.com (8.12.8/8.12.8) with ESMTP id h5B1kS8g031264 for ; Tue, 10 Jun 2003 18:46:28 -0700 Original-Received: (from jwr@localhost) by tnuctip.rychter.com (8.12.8/8.12.8/Submit) id h5B1kL99031262; Tue, 10 Jun 2003 18:46:21 -0700 Original-To: ding@gnus.org X-Spammers-Please: blackholeme@rychter.com In-Reply-To: (Simon Josefsson's message of "Thu, 05 Jun 2003 23:53:31 +0200") User-Agent: Gnus/5.1002 (Gnus v5.10.2) XEmacs/21.4 (Portable Code, linux) Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:53094 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:53094 >>>>> "Simon" == Simon Josefsson writes: Simon> Jan Rychter writes: >> ... and I don't know what it is. >> >> I get lots of mail with RFC2047-encoded From: fields. Mostly >> ISO-8859-2. Now, when mail-extr gets to those headers, they are >> already in 8-bit form, which causes mail-extr to fail tragically. I >> get chopped names, annoying BBDB behavior, and all sorts of >> breakage. People who get ISO-8859-1 E-mail don't see this, because >> mail-extr.el is "extended" to handle ISO-8859-1 in a particularly >> ugly way. Simon> Do you have a backtrace that says where mail-extr is invoked? Silly me. I should have prepared it right away instead of wasting your time. I don't know why I didn't think of it. Sorry about that. Simon> I think it might be a bug to invoke mail-extr on 8bit data. It is. It is really not designed to do that, and it only works for a subset of ISO-8859-1. Actually, I have the distinct impression that if we removed the 8859-1 support from mail-extr.el, the breakage would be noticed by many more people. >> I've tried to trace what it is that decodes the headers to 8bit >> before mail-extr sees them -- but I have no idea. They are >> RFC2047-encoded on the disk. Looking through my init files also did >> not produce any clues. Simon> Note that if it is bbdb that calls mail-extr, it is probably a Simon> bug in bbdb. Chopped names could indicate a bug in rfc2047.el Simon> though, there where bugs like this recently (perhaps not fixed? Simon> I recall emacs bugs caused them). Well, interestingly enough, mail-extract-address-components is being called TWICE whenever I enter a message. Once by Gnus, once by BBDB. In both cases, an 8-bit string is being passed. Backtraces below, with the name of the person in the From: field changed to my spamtrap: mail-extract-address-components("full 8bit name ") mm-dissect-buffer(nil nil) gnus-display-mime() gnus-article-prepare-display() gnus-article-prepare(96804 nil) gnus-summary-display-article(96804) gnus-summary-next-article(nil nil t) gnus-summary-prev-article(nil) call-interactively(gnus-summary-prev-article) mail-extract-address-components("full 8bit name ") bbdb-rfc822-addresses("full 8bit name ") bbdb-get-addresses(t "Jan Rychter" mail-fetch-field) bbdb/gnus-update-records(nil) bbdb/gnus-pop-up-bbdb-buffer() run-hooks(bbdb/gnus-pop-up-bbdb-buffer) apply(run-hooks bbdb/gnus-pop-up-bbdb-buffer) gnus-run-hooks(gnus-article-prepare-hook) gnus-article-prepare-display() gnus-article-prepare(96804 nil) gnus-summary-display-article(96804) gnus-summary-next-article(nil nil t) gnus-summary-prev-article(nil) call-interactively(gnus-summary-prev-article) As a reminder, this results in mail-extract-address-components returning a "chopped" partial name, because of an 8-bit character in the middle. Does this help to narrow the problem? thanks, --J.