From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/32496 Path: main.gmane.org!not-for-mail From: Simon Josefsson Newsgroups: gmane.emacs.gnus.general Subject: Re: imap breaks latin-1 characters Date: 19 Sep 2000 11:27:26 +0200 Sender: owner-ding@hpc.uh.edu Message-ID: References: <87vgvu81n4.fsf@gnu.org> <200009181322.e8IDMYg03611@zsh.2y.net> <200009181517.e8IFHV503937@zsh.2y.net> <200009181958.e8IJwRT06371@zsh.2y.net> <200009182234.e8IMY0R07025@zsh.2y.net> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035168773 20135 80.91.224.250 (21 Oct 2002 02:52:53 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 02:52:53 +0000 (UTC) Cc: ding@gnus.org Return-Path: Original-Received: from fisher.math.uh.edu (fisher.math.uh.edu [129.7.128.35]) by mailhost.sclp.com (Postfix) with ESMTP id 26C46D051E for ; Tue, 19 Sep 2000 06:18:43 -0400 (EDT) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by fisher.math.uh.edu (8.9.1/8.9.1) with ESMTP id FAC23444; Tue, 19 Sep 2000 05:18:28 -0500 (CDT) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Tue, 19 Sep 2000 05:17:53 -0500 (CDT) Original-Received: from mailhost.sclp.com (postfix@66-209.196.61.interliant.com [209.196.61.66] (may be forged)) by sina.hpc.uh.edu (8.9.3/8.9.3) with ESMTP id FAA05650 for ; Tue, 19 Sep 2000 05:17:39 -0500 (CDT) Original-Received: from vic20.blipp.com (unknown [195.163.165.35]) by mailhost.sclp.com (Postfix) with ESMTP id 6C56CD051E for ; Tue, 19 Sep 2000 06:18:02 -0400 (EDT) Original-Received: from barbar.josefsson.org (localhost [127.0.0.1]) by vic20.blipp.com (8.10.1/8.10.1) with ESMTP id e8JAHpS04203; Tue, 19 Sep 2000 12:17:53 +0200 Original-To: ShengHuo ZHU In-Reply-To: <200009182234.e8IMY0R07025@zsh.2y.net> Mail-Copies-To: nobody Original-Lines: 51 User-Agent: Gnus/5.0808 (Gnus v5.8.8) Emacs/20.7 Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:32496 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:32496 ShengHuo ZHU writes: > > > In the second case, string-as-multibyte did the trick, but it is not a > > > total solution, especially for those cases with \200-\237 in the > > > string. For example, (string-as-multibyte "1234\201\337ABCD") returns > > > a string with one latin-iso8859-1 character instead of two 8-bit > > > characters. > > > > Ok, I see the problems. Will this ever be fixed in Emacs 20.x, or is > > the answer to wait for Emacs 21 here? Replacing unibyte buffers with > > string-as-multibyte's is of no use if string-as-multibyte is buggy. > > It is probably a feature instead of a bug. But the document "contains > an individual 8-bit byte (i.e. not part of multibyte form)" is > confusing. But won't this cause problems for us? If we replace unibyte buffers with string-as-multibyte where necessery (which I agree with), we'll mutilate mail that contain \200-\237 character? Since both (string-as-multibyte "1234\337ABCD") and (string-as-multibyte "1234\201\337ABCD") look the same, I think we're in trouble. Or is there a (string-as-multibyte-foo "1234\201\337ABCD") that return a multibyte string that will display as 1234\201ßABCD? Isn't that what we need? > > More problems: Press C-x C-e in the article buffer to evaluate your > > examples. I get two \201's in the echo area from the first example > > and one \201 from the second. Why?! The echo area _is_ a multibyte > > buffer, isn't it? > > > I've seen \201's in the echo area before (BBDB) but never elsewhere, > > this might be the same issue. I've no idea how to debug this. > > Wait a second. I see `1234\201ßABCD' from the first case and > `1234ßABCD' from the second in both Emacs 20.7 and 21.0.90. Did you > see EXACTLY? Now I don't get that behaviour, but I'm sure I did last time. Interesting. Ok, now I get it again. All I did was to do something else for a while. I'll try to narrow it down to specific commands. I first thought it was the backlog, but it doesn't seem to be. First case: `1234\201\201ßABCD' Second case: `1234\201ßABCD'. In echo area. A freshly started emacs/gnus display one less \201 in both cases. In the *scratch* buffer I always seem to get `1234\201ßABCD' and `1234ßABCD' respectively. Emacs 20.7.