Announcements and discussions for Gnus, the GNU Emacs Usenet newsreader
 help / color / mirror / Atom feed
From: Garjola Dindi <garjola@garjola.net>
To: info-gnus-english@gnu.org
Subject: Incorrect character encoding in received messages
Date: Wed, 25 Dec 2019 19:13:21 +0100	[thread overview]
Message-ID: <87pngc6zpa.fsf@pc-117-162.ovh.com> (raw)

Hi all,

I have recently been having trouble with Gnus decoding some e-mails as
ASCII when actually they should be decoded as unicode.

For instance, in French, the “à” char gets displayed as “\340”.

If I go to «edit mode» with 'gnus-summary-edit-article' and just do C-c
C-c (with no real edit), the message gets displayed correctly.

Another example with the "é" char which appears as 'i' in an HTML
message. Describe char gives me this:
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >
>             character: i (displayed as i) (codepoint 105, #o151, #x69)                    >
>               charset: ascii (ASCII (ISO646 IRV))                                         >
> code point in charset: 0x69                                                               >
>                script: latin                                                              >
>                syntax: w 	which means: word                                           >
>              category: .:Base, L:Left-to-right (strong), a:ASCII, l:Latin, r:Roman        >
>              to input: type "C-x 8 RET 69" or "C-x 8 RET LATIN SMALL LETTER I"            >
>           buffer code: #x69                                                               >
>             file code: #x69 (encoded by coding system utf-8-unix)                         >
>               display: by this font (glyph code)                                          >
>     xfthb:-PfEd-DejaVu Sans-normal-normal-normal-*-16-*-*-*-*-0-iso10646-1 (#x4C)         >
>                                                                                           >
> Character code properties: customize what to show                                         >
>   name: LATIN SMALL LETTER I                                                              >
>   general-category: Ll (Letter, Lowercase)                                                >
>   decomposition: (105) ('i')                                                              >
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >

And after 'gnus-summary-edit-article' followed by C-c C-c:

>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >
>             character: é (displayed as é) (codepoint 233, #o351, #xe9)                                 >
>               charset: unicode (Unicode (ISO10646))                                                    >
> code point in charset: 0xE9                                                                            >
>                script: latin                                                                           >
>                syntax: w 	which means: word                                                        >
>              category: .:Base, L:Left-to-right (strong), c:Chinese, j:Japanese, l:Latin, v:Viet        >
>              to input: type "C-x 8 RET e9" or "C-x 8 RET LATIN SMALL LETTER E WITH ACUTE"              >
>           buffer code: #xC3 #xA9                                                                       >
>             file code: #xC3 #xA9 (encoded by coding system utf-8-unix)                                 >
>               display: by this font (glyph code)                                                       >
>     xfthb:-PfEd-DejaVu Sans-normal-normal-normal-*-16-*-*-*-*-0-iso10646-1 (#xAB)                      >
>                                                                                                        >
> Character code properties: customize what to show                                                      >
>   name: LATIN SMALL LETTER E WITH ACUTE                                                                >
>   old-name: LATIN SMALL LETTER E ACUTE                                                                 >
>   general-category: Ll (Letter, Lowercase)                                                             >
>   decomposition: (101 769) ('e' '́')                                                                    >
> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >

Any idea of what may be happening here?

I am on emacs master, but there is no difference with 26.3.

Thanks!

G.

-- 


_______________________________________________
info-gnus-english mailing list
info-gnus-english@gnu.org
https://lists.gnu.org/mailman/listinfo/info-gnus-english

             reply	other threads:[~2019-12-25 18:13 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-12-25 18:13 Garjola Dindi [this message]
2019-12-27 18:07 ` Lars Ingebrigtsen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87pngc6zpa.fsf@pc-117-162.ovh.com \
    --to=garjola@garjola.net \
    --cc=info-gnus-english@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).