From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/16926 Path: main.gmane.org!not-for-mail From: =?ISO-8859-1?Q?Fran=E7ois_Pinard?= Newsgroups: gmane.emacs.gnus.general Subject: Re: "Coding system"? Eh? Date: 11 Sep 1998 10:27:03 +-400 Sender: owner-ding@hpc.uh.edu Message-ID: References: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035155718 30690 80.91.224.250 (20 Oct 2002 23:15:18 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sun, 20 Oct 2002 23:15:18 +0000 (UTC) Cc: davidk@lysator.liu.se (David Kågedal), ding@gnus.org Return-Path: Original-Received: from gizmo.hpc.uh.edu (gizmo.hpc.uh.edu [129.7.102.31]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id KAA14038 for ; Fri, 11 Sep 1998 10:53:07 -0400 (EDT) Original-Received: from sina.hpc.uh.edu (sina.hpc.uh.edu [129.7.3.5]) by gizmo.hpc.uh.edu (8.7.6/8.7.3) with ESMTP id JAF11262; Fri, 11 Sep 1998 09:24:04 -0500 Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Fri, 11 Sep 1998 09:51:29 -0500 (CDT) Original-Received: from sclp3.sclp.com (root@sclp3.sclp.com [209.195.19.139]) by sina.hpc.uh.edu (8.7.3/8.7.3) with ESMTP id JAA04569 for ; Fri, 11 Sep 1998 09:51:12 -0500 (CDT) Original-Received: from pluton.rtsq.qc.ca (pluton.grics.qc.ca [199.84.132.10]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id KAA13978 for ; Fri, 11 Sep 1998 10:50:55 -0400 (EDT) Original-Received: by pluton.rtsq.qc.ca (8.8.8/8.8.8) with UUCP id KAA04033; Fri, 11 Sep 1998 10:27:34 -0400 Original-Received: by icule.progiciels-bpi.ca (8.8.5/8.7.3) id KAA01668; Fri, 11 Sep 1998 10:27:03 -0400 Original-To: Gisle Aas X-Face: "b_m|CE6#'Q8fliQrwHl9K,]PA_o'*S~Dva{~b1n*)K*A(BIwQW.:LY?t4~xhYka_.LV?Qq `}X|71X0ea&H]9Dsk!`kxBXlG;q$mLfv_vtaHK_rHFKu]4'<*LWCyUe@ZcI6"*wB5M@[m écrit: > Unicode is in sync with ISO 10646. Also Unicode allocates characters > above U+FFFF. http://www.unicode.org/unicode/alloc/Pipeline.html. Thanks for the reference, I'll later take a look when I'll be on the net. > > The difference between UTF-16 and UCS-2 is that it can encode some of > > the charaters outside the Unicode range (BMP). So I guess Unicode has > > no need for UTF-16. > Unicode 2.x is in a way UTF-16. Yes, I got that feeling, even if I did not buy the books (it becomes expensive after a while, when you use your own money for it :-). There is some sadness in all this. The original idea was to have the capability of a set of fixed width characters covering all spoken languages. Look were we are now. UTF-16 is a variable width code, we have a lot of combining characters, and various marks for byte order, for directionality, and so forth. Many characters are missing (to the point this creates me problems within `recode'), and a non-negligible fraction of Japanese users are highly irritated by Han unification, and other things. Many people still think Unicode / ISO 10646 is another step of mankind towards God, but if look a bit inside, you'll see that reality has run and caught back progress, pretty fast, sadly enough. -- François Pinard mailto:pinard@iro.umontreal.ca Join the free Translation Project! http://www.iro.umontreal.ca/~pinard