From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/47989 Path: main.gmane.org!not-for-mail From: Reiner Steib <4uce.02.r.steib@gmx.net> Newsgroups: gmane.emacs.gnus.general Subject: Re: UTF-8 breakage Date: Mon, 02 Dec 2002 12:42:57 +0100 Sender: owner-ding@hpc.uh.edu Message-ID: References: <84d6otk0a6.fsf@lucy.cs.uni-dortmund.de> Reply-To: reiner.steib@gmx.de NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: quoted-printable X-Trace: main.gmane.org 1038829475 32125 80.91.224.249 (2 Dec 2002 11:44:35 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 2 Dec 2002 11:44:35 +0000 (UTC) Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 18Ip05-0008Lw-00 for ; Mon, 02 Dec 2002 12:44:34 +0100 Original-Received: from sina.hpc.uh.edu ([129.7.128.10] ident=lists) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 18Ioz6-0005xV-00; Mon, 02 Dec 2002 05:43:32 -0600 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Mon, 02 Dec 2002 05:44:20 -0600 (CST) Original-Received: from sclp3.sclp.com (qmailr@sclp3.sclp.com [209.196.61.66]) by sina.hpc.uh.edu (8.9.3/8.9.3) with SMTP id FAA07211 for ; Mon, 2 Dec 2002 05:43:59 -0600 (CST) Original-Received: (qmail 8417 invoked by alias); 2 Dec 2002 11:42:59 -0000 Original-Received: (qmail 8412 invoked from network); 2 Dec 2002 11:42:59 -0000 Original-Received: from theotp5.physik.uni-ulm.de (134.60.10.145) by gnus.org with SMTP; 2 Dec 2002 11:42:59 -0000 Original-Received: (from ste@localhost) by theotp5.physik.uni-ulm.de (8.11.2/8.11.2) id gB2Bgvn08748; Mon, 2 Dec 2002 12:42:57 +0100 X-Authentication-Warning: theotp5.physik.uni-ulm.de: ste set sender to reiner.steib@physik.uni-ulm.de using -f Original-To: ding@gnus.org X-Face: 'bg&jY[8V'W&:=~6w"|>}#4/T;w~36ei4NNMyKRR.a$n=$|sWFPF1y]a\>6kc\*#GN]UDM| Ywv,vbL^XF1nIp\:F=$Ei2o&mEe:%N~,:3]vtQ~s9u$9izmX$IF@VgGl7/,^dbuM<3|AO2}.%|%?kZ 2Y=@\U!~cll^=8Z9ihKq%wmUe1Ky(#kl3T'>Qk0Ia3mCBsTk?E(,X Mail-Copies-To: nobody Mail-Followup-To: ding@gnus.org User-Agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.3.50 (i686-pc-linux-gnu) Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:47989 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:47989 On Mon, Dec 02 2002, Simon Josefsson wrote: > Reiner Steib <4uce.02.r.steib@gmx.net> writes: >> On Sun, Dec 01 2002, Simon Josefsson wrote: >> I didn't mention (and test) Gnus 5.8.8, because I though that it's not >> recommended to use Emacs 21.x together with Gnus 5.8.8 (as Gnus 5.9 is >> shipped with Emacs 21.x). > > Ah. I was thinking of XEmacs users, but perhaps the page was about > Emacs only. It is about Emacs only. For XEmacs, I only know that there's the latin-unity package. But, as I mentioned in the last post, people report that they can't get proper unification with XEmacs 21.4 and Gnus 5.8.8. >>> I think Oort is supposed to unify at least across ISO-8859-* >>> automatically when possible [...] r=E4ksm=F6rg=E5s r=A4ksm=A4rg=A4s >> When I *dis*able unify-8859-on-{de,en}coding-mode and add a `=DF' >> (Latin-1) replying to your message, I get utf-8 (checked with >> `mml-preview') instead of Latin-9. > > Do you instruct Emacs or Gnus to prefer UTF-8 anywhere?=20=20 No, at least not by purpose. I tested it with a minimal setup now: - Starting Emacs 21.1 (LC_CTYPE=3Den_US.ISO_8859-1) - reloading `utf-8', enable unify-8859-on-{de,en}coding-mode [0] - start Oort Gnus - replying to your message and inserting Latin-1 characters - disable unify-8859-on-{de,en}coding-mode - C-u C-c RET P =3D=3D> utf-8 - Starting Emacs 21.1 and Oort Gnus (LC_CTYPE=3Den_US.ISO_8859-1) - replying to your message and inserting Latin-1 characters (LC_CTYPE=3Den_US.ISO_8859-1) - C-u C-c RET P =3D=3D> Latin-9 > I think the unification is to the preferred charset. It works (without unify-8-o-*-mode) sometimes, but not always: - Starting Emacs 21.1 and Oort Gnus (LC_CTYPE=3Den_US.ISO_8859-1) - Reply to news:m3fzthfiyl.fsf@merlin.emma.line.org (Article with a `=DF' from Latin-9). - insert a Latin-1 Currency sign or acute accent[1] (not codable in Latin-9!) - C-u C-c RET P =3D=3D> "A message part needs to be split into 2 charset parts. Really sen= d?" With unify-8-o-*-mode, I get unification to Latin-1. - Starting Emacs 21.1 and Oort Gnus (LC_CTYPE=3Den_US.ISO_8859-1) - start with a new article - insert a Latin-1 `=E4' - insert a unicode =A4 [2] - C-u C-c RET P =3D=3D> "... split into 2 charset parts ..." --> charset=3Diso-8859-15 (Part 1) --> charset=3Dcharset=3Dutf-8 (Part 2) With unify-8-o-*-mode, I get unification to Latin-9 in this case. So I think that even with Oort ucs-tables is still useful. Bye, Reiner. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - [0] This is what the file `rs-gnus-unify.el' does (apart from some tests). =20=20=20=20 [1] (insert (string (make-char 'latin-iso8859-1 164))) ; CURRENCY SIGN (insert (string (make-char 'latin-iso8859-1 180))) ; ACUTE ACCENT [2] (insert (string (make-char 'mule-unicode-0100-24ff 116 76))); EURO --=20 ,,, (o o) ---ooO-(_)-Ooo--- PGP key available via WWW http://rsteib.home.pages.de/