From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/88216 Path: news.gmane.org!.POSTED!not-for-mail From: =?utf-8?Q?Adam_Sj=C3=B8gren?= Newsgroups: gmane.emacs.gnus.general,gmane.emacs.devel Subject: Re: Resending email in Gnus, figuring out charset Date: Wed, 31 Oct 2018 20:43:44 +0100 Organization: koldfront - analysis & revolution, Copenhagen, Denmark Message-ID: <87muqtlx5b.fsf@tullinup.koldfront.dk> References: <87in1ktvau.fsf@tullinup.koldfront.dk> <87tvl4h2th.fsf@igel.home> <87d0rstpoj.fsf@tullinup.koldfront.dk> <83o9bckl57.fsf@gnu.org> <87sh0mlzjm.fsf@tullinup.koldfront.dk> <83ftwmhqqy.fsf@gnu.org> NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: blaine.gmane.org 1541015127 8557 195.159.176.226 (31 Oct 2018 19:45:27 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Wed, 31 Oct 2018 19:45:27 +0000 (UTC) User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) Cc: emacs-devel@gnu.org To: ding@gnus.org Original-X-From: ding-owner+M36428@lists.math.uh.edu Wed Oct 31 20:45:23 2018 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from lists1.math.uh.edu ([129.7.128.208]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gHwQZ-00025H-Kn for ding-account@gmane.org; Wed, 31 Oct 2018 20:45:19 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by lists1.math.uh.edu with smtp (Exim 4.90_1) (envelope-from ) id 1gHwSc-0003iZ-Cz; Wed, 31 Oct 2018 14:47:26 -0500 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by lists1.math.uh.edu with esmtps (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.90_1) (envelope-from ) id 1gHwSU-0003fb-P8 for ding@lists.math.uh.edu; Wed, 31 Oct 2018 14:47:18 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtps (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.90_1) (envelope-from ) id 1gHwST-0002Nt-47 for ding@lists.math.uh.edu; Wed, 31 Oct 2018 14:47:18 -0500 Original-Received: from [195.159.176.226] (helo=blaine.gmane.org) by quimby.gnus.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gHwSR-0000lq-Vi for ding@gnus.org; Wed, 31 Oct 2018 20:47:15 +0100 Original-Received: from list by blaine.gmane.org with local (Exim 4.84_2) (envelope-from ) id 1gHwQK-0001px-O2 for ding@gnus.org; Wed, 31 Oct 2018 20:45:04 +0100 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 77 Original-X-Complaints-To: usenet@blaine.gmane.org OpenPGP: id=476630590A231909B0A0961A49D0746121BDE416; url=https://asjo.koldfront.dk/gpg.asc Mail-Follow-Up-To: never X-Now-Playing: Like A Military Coup, The Antidote (Morcheeba) X-Face: )qY&CseJ?.:=8F#^~GcSA?F=9eu'{KAFfL1C3/A&:nE?PW\i65"ba0NS)97,Q(^@xk}n4Ou rPuR#V8I(J_@~H($[ym:`K_+]*kjvW>xH5jbgLBVFGXY:(#4P>zVBklLbdL&XxL\M)%T}3S/IS9lMJ ^St'=VZBR Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:88216 gmane.emacs.devel:230908 Archived-At: Eli writes: >> The Content-Transfer-Encoding: 8bit header means "raw bytes in the >> body", and the Content-Type: text/plain; charset=utf-8 explains how >> those bytes should be interpreted, right? > > These headers tell the receiving end how to interpret the message. Yes. So as I received this email, Gnus should be interpreting the bytes at utf-8. And it seems to be, as they are displayed correctly. > But I meant something different: what you have in the Gnus buffer > _before_ the message is sent. Before I resend the message, the buffer looks correct (i.e. I see the the arrow and the accented e rather than \nnn\nnn\nnn etc.) >> When I look at the feedbase-email in Gnus, it is displayed as expected, >> but when I try to resend it, for some reason Gnus can't guess what the >> encoding should be. > > That's a sign of raw bytes in the buffer. > > If you go to one of the offending characters in the Gnus buffer and > type "C-u C-x =", what does Emacs show about those characters? Ok, if I open the feedbase-email in Gnus, before I press S D r to resend, and move point to → and é in the *Article* buffer, I get: position: 530 of 684 (77%), column: 1 character: → (displayed as →) (codepoint 8594, #o20622, #x2192) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0x2192 script: symbol syntax: . which means: punctuation category: .:Base, c:Chinese, h:Korean, j:Japanese to input: type "C-x 8 RET 2192" or "C-x 8 RET RIGHTWARDS ARROW" buffer code: #xE2 #x86 #x92 file code: #xE2 #x86 #x92 (encoded by coding system utf-8-unix) display: by this font (glyph code) xft:-PfEd-DejaVu Sans Mono-normal-normal-normal-*-20-*-*-*-m-0-iso10646-1 (#x7AE) Character code properties: customize what to show name: RIGHTWARDS ARROW old-name: RIGHT ARROW general-category: Sm (Symbol, Math) decomposition: (8594) ('→') and: position: 284 of 684 (41%), column: 6 character: é (displayed as é) (codepoint 233, #o351, #xe9) preferred charset: unicode (Unicode (ISO10646)) code point in charset: 0xE9 script: latin syntax: w which means: word category: .:Base, L:Left-to-right (strong), c:Chinese, j:Japanese, l:Latin, v:Viet to input: type "C-x 8 RET e9" or "C-x 8 RET LATIN SMALL LETTER E WITH ACUTE" buffer code: #xC3 #xA9 file code: #xC3 #xA9 (encoded by coding system utf-8-unix) display: by this font (glyph code) xft:-PfEd-DejaVu Sans Mono-normal-normal-normal-*-20-*-*-*-m-0-iso10646-1 (#xAB) Character code properties: customize what to show name: LATIN SMALL LETTER E WITH ACUTE old-name: LATIN SMALL LETTER E ACUTE general-category: Ll (Letter, Lowercase) decomposition: (101 769) ('e' '́') Best regards, Adam -- "God must've been punting angels left and right." Adam Sjøgren asjo@koldfront.dk