From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/59575 Path: main.gmane.org!not-for-mail From: Reiner Steib Newsgroups: gmane.emacs.gnus.general Subject: Re: text/plain; charset=UTF-8 doesn't work anymore Date: Fri, 14 Jan 2005 13:04:30 +0100 Organization: Dept. of Theoretical Physics, University of Ulm Message-ID: References: <87vfa0o7ag.fsf@dod.no> Reply-To: Reiner Steib NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1105704637 20368 80.91.229.6 (14 Jan 2005 12:10:37 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 14 Jan 2005 12:10:37 +0000 (UTC) Original-X-From: ding-owner+M8115@lists.math.uh.edu Fri Jan 14 13:10:21 2005 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13] ident=mail) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CpQHU-0000mh-00 for ; Fri, 14 Jan 2005 13:10:20 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu ident=lists) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1CpQCb-0004S3-00; Fri, 14 Jan 2005 06:05:17 -0600 Original-Received: from util2.math.uh.edu ([129.7.128.23]) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1CpQCS-0004Rv-00 for ding@lists.math.uh.edu; Fri, 14 Jan 2005 06:05:08 -0600 Original-Received: from quimby.gnus.org ([80.91.224.244]) by util2.math.uh.edu with esmtp (Exim 4.30) id 1CpQCM-0007B2-4h for ding@lists.math.uh.edu; Fri, 14 Jan 2005 06:05:02 -0600 Original-Received: from main.gmane.org ([80.91.224.249]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1CpQCK-0007qi-00 for ; Fri, 14 Jan 2005 13:05:00 +0100 Original-Received: from list by main.gmane.org with local (Exim 3.35 #1 (Debian)) id 1CpQCK-0005YU-00 for ; Fri, 14 Jan 2005 13:05:00 +0100 Original-Received: from bridgekeeper.physik.uni-ulm.de ([134.60.10.123]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 14 Jan 2005 13:05:00 +0100 Original-Received: from Reiner.Steib by bridgekeeper.physik.uni-ulm.de with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Fri, 14 Jan 2005 13:05:00 +0100 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: ding@gnus.org Original-To: ding@gnus.org Original-Lines: 70 Original-X-Complaints-To: usenet@sea.gmane.org X-Gmane-NNTP-Posting-Host: bridgekeeper.physik.uni-ulm.de X-Face: mtjf/D:es1T0wHO:&CJ'ZXe"l;3C--rw\z!{`eFwL){|]RpI+4{u25L=5C /0>KuGeTsk<~<&NE-AKV1560e!+RJeyWmSskkrJm?[vUV#66{T_m|Ae<||Ku#Mk5`y&O`n~z2;n8eP J5#2h@2eQgV@E70IY_0WlEx!"&giy{+\%h1LJox$zv@/l%ZmU4^tZA>xQpnkUBVC5.jpg#0'(+2?Rs )NAr:>3<=WxHE$ktbLysDIM5TbmHu*3>>>>> In Reiner Steib wrote: > >> I'd strongly recommend to get rid of mule-ucs in Emacs 21. > > But it is essential if a user want to encode Japanese (and > possibly Chinese, Korean, etc.) text with the released versions > of Emacs 21 (i.e., 21.3 and earlier). Sorry, I wasn't aware of this. I know that CJK support in Emacs 21.3 is not very good. As Steinar is from Norway, this is probably not very important for him, though. I seem to remember that Mule-UCS has some problem with Emacs 21. > Try the following without Mule-UCS. > > (let ((str (string (make-char 'japanese-jisx0208 36 34)))) > (list str > (decode-coding-string > (encode-coding-string str 'utf-8) > 'utf-8))) ,---- | *** Welcome to IELM *** Type (describe-mode) for help. | ELISP> (let ((str (string (make-char 'japanese-jisx0208 36 34)))) | (list str | (decode-coding-string | (encode-coding-string str 'utf-8) | 'utf-8))) | ("あ" "�") | | ELISP> emacs-version | "21.3.1" | ELISP> `---- Okay, it should have been the same character, I guess, but it isn't: ,---- | character: あ (0151042, 53794, 0xd222) | charset: japanese-jisx0208 (JISX0208.1983/1990 Japanese Kanji: ISO-IR-87) | code point: 36 34 | syntax: word | category: H:Japanese Hiragana characters of 2-byte character sets | j:Japanese | |:While filling, we can break a line at this character. | buffer code: 0x92 0xA4 0xA2 | file code: not encodable by coding system iso-latin-9 | font: -Misc-Fixed-Medium-R-Normal--14-130-75-75-C-140-JISX0208.1983-0 `---- ,---- | character: � (01175275, 326333, 0x4fabd) | charset: mule-unicode-e000-ffff | (Unicode characters of the range U+E000..U+FFFF.) | code point: 117 61 | syntax: word | category: | buffer code: 0x9C 0xF3 0xF5 0xBD | file code: not encodable by coding system iso-latin-9 | font: -Misc-Fixed-Medium-R-Normal--14-130-75-75-C-70-ISO10646-1 `---- Bye, Reiner. -- ,,, (o o) ---ooO-(_)-Ooo--- | PGP key available | http://rsteib.home.pages.de/