Gnus development mailing list
 help / color / mirror / Atom feed
From: Dave Love <d.love@dl.ac.uk>
Subject: Re: \201 irritation! :-)
Date: Mon, 25 Sep 2000 12:55:16 +0100	[thread overview]
Message-ID: <200009251155.MAA15586@djlvig.dl.ac.uk> (raw)
In-Reply-To: <87og1f8imp.fsf@deneb.enyo.de>

>>>>> "FW" == Florian Weimer <fw@deneb.enyo.de> writes:

 FW> If there is a byte-combination, "(char-after)" returns values
 FW> outside the usual 0 .. 255 range, and
 FW> "quoted-printable-encode-region" doesn't handle this.

Then it's either intrinsically broken or not being used appropriately
since it doesn't make sense for multibyte characters.  I suppose I'll
have to check it if no-one else can.

 FW> The best thing probably is to switch the buffer to
 FW> uni-byte mode during quoted-printable encoding.

Whatever you do, you need a buffer with encoded contents.  Making it
unibyte isn't right otherwise or you're dealing with the emacs-mule
charset.

 >> I don't know what that means.  Just spurious combination of leading
 >> bytes stuffed raw into a multibyte buffer or something else?

 FW> Take the "Chinese" line from the HELLO file, copy it to a
 FW> multi-byte buffer, do "encode-coding-region" on it and specify
 FW> "utf-8" as encoding.  Copy the result to a unibyte buffer, and
 FW> paste it back into the first (multi-byte) buffer.  

This seems to be just confusing several issues.  I don't see how it's
related to spurious byte combination, unless it refers to some problem
in Mule-UCS itself.  Can you give a code snippet demonstrating the
problems you're concerned about without using utf-8 as an example?  It
shouldn't be relevant.  Be careful to include the relevant context
(C-h C, at least) since you're concerned with code conversion.

 FW> Now the \201s are there, you can see them if you switch the first
 FW> buffer to uni-byte mode.

Of course you see the internal encoding if you switch it to unibyte
mode anyway.  You may also have pasted in raw bytes.

 FW> The trouble with UTF-8 is that it tends to generate more
 FW> byte-combinations than other encodings, as it seems.

 >> Do you mean _de_code-coding-region?  `encode-coding-region' would
 >> produce raw bytes.

 FW> And these raw bytes are not properly dealt with in multi-byte buffers.

So don't do that.  That's the point.

I'm not getting any indication that there is an issue here that needs
to be addressed in Emacs in future.



  reply	other threads:[~2000-09-25 11:55 UTC|newest]

Thread overview: 67+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2000-08-27 17:21 François Pinard
2000-08-27 21:13 ` François Pinard
2000-08-28  8:44   ` Kai Großjohann
2000-08-28 19:14 ` Stainless Steel Rat
2000-08-28 19:54   ` François Pinard
2000-08-28 21:36     ` Stainless Steel Rat
2000-08-28 22:46       ` François Pinard
2000-09-05 14:29         ` Dave Love
2000-09-05 16:19           ` Stainless Steel Rat
2000-09-08 23:32             ` Dave Love
2000-09-09  2:49               ` Stainless Steel Rat
2000-09-06 14:56           ` François Pinard
2000-09-08 22:40             ` Dave Love
2000-09-16 18:14               ` Florian Weimer
2000-09-18 14:07                 ` Dave Love
2000-09-20 16:56                   ` Florian Weimer
2000-09-20 19:26                     ` Kai Großjohann
2000-11-11 16:35                       ` Florian Weimer
2000-11-15  5:38                         ` Karl Eichwalder
2000-11-27 13:40                         ` François Pinard
2000-11-27 14:35                           ` tick and backtick in unicode Alan Shutko
2000-11-27 16:56                             ` François Pinard
2000-11-27 19:47                               ` Karl Eichwalder
2000-11-27 20:57                                 ` Stainless Steel Rat
2000-11-28  6:01                                 ` François Pinard
2000-11-29  5:45                                   ` Karl Eichwalder
2000-11-30  1:57                                     ` Kai Großjohann
2000-11-30  4:19                                       ` Karl Eichwalder
2000-11-30  6:33                                         ` Graham Murray
2000-11-30 12:38                                         ` Per Abrahamsen
2000-11-30 11:23                                       ` Toby Speight
2000-09-21 19:33                     ` \201 irritation! :-) Dave Love
2000-09-23 11:14                       ` Florian Weimer
2000-09-25 11:55                         ` Dave Love [this message]
2000-09-28 12:22                           ` Florian Weimer
2000-09-29 19:18                             ` QP encoding is now unibyte (was: " Florian Weimer
2000-10-03 23:17                             ` Dave Love
2000-09-05 14:16 ` Dave Love
2000-09-06 14:49   ` François Pinard
2000-09-06 20:15     ` Kai Großjohann
2000-09-07  7:34       ` Yair Friedman (Jerusalem)
2000-09-08  9:31         ` Janne Rinta-Manty
2000-09-08 17:49           ` Simon Josefsson
2000-09-08 23:44         ` Dave Love
2000-09-12  8:51           ` Yair Friedman (Jerusalem)
2000-09-12 12:02             ` Kai Großjohann
2000-09-13 19:25             ` Dave Love
2000-09-14  7:49               ` Yair Friedman (Jerusalem)
2000-09-14 12:18                 ` ShengHuo ZHU
2000-09-14 21:50                   ` Dave Love
2000-09-14 23:06                     ` ShengHuo ZHU
2000-09-18 13:51                       ` Dave Love
2000-09-18 14:39                         ` ShengHuo ZHU
2000-09-22 14:23                           ` Dave Love
2000-09-14 21:42                 ` Dave Love
2000-09-15  3:29                 ` ShengHuo ZHU
2000-09-20  9:34                   ` Yair Friedman (Jerusalem)
2000-09-20 13:06                     ` ShengHuo ZHU
2000-09-20 14:55                       ` Yair Friedman (Jerusalem)
2000-09-20 14:16                         ` ShengHuo ZHU
2000-09-21  8:35                           ` Yair Friedman (Jerusalem)
2000-09-21 19:02                             ` Dave Love
2000-09-23  6:35                             ` Karl Eichwalder
2000-09-25 11:33                               ` Dave Love
2000-09-25 18:50                                 ` Karl Eichwalder
2000-09-27 21:56                                   ` Dave Love
2000-09-08 22:33     ` Dave Love

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200009251155.MAA15586@djlvig.dl.ac.uk \
    --to=d.love@dl.ac.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).