From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/21064 Path: main.gmane.org!not-for-mail From: =?ISO-8859-1?Q?Fran=E7ois_Pinard?= Newsgroups: gmane.emacs.gnus.general Subject: Re: More charset things Date: 08 Feb 1999 18:19:22 -0500 Sender: owner-ding@hpc.uh.edu Message-ID: References: <87d83qkyjf.fsf@pc-hrvoje.srce.hr> <87ognahyoh.fsf@pc-hrvoje.srce.hr> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035159241 21039 80.91.224.250 (21 Oct 2002 00:14:01 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 00:14:01 +0000 (UTC) Return-Path: Original-Received: from spinoza.math.uh.edu (spinoza.math.uh.edu [129.7.128.18]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id SAA00441 for ; Mon, 8 Feb 1999 18:20:26 -0500 (EST) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by spinoza.math.uh.edu (8.9.1/8.9.1) with ESMTP id RAB07733; Mon, 8 Feb 1999 17:19:37 -0600 (CST) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Mon, 08 Feb 1999 17:20:06 -0600 (CST) Original-Received: from sclp3.sclp.com (root@sclp3.sclp.com [204.252.123.139]) by sina.hpc.uh.edu (8.7.3/8.7.3) with ESMTP id RAA27956 for ; Mon, 8 Feb 1999 17:19:56 -0600 (CST) Original-Received: from degusse.IRO.UMontreal.CA (degusse.IRO.UMontreal.CA [132.204.24.51]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id SAA00432 for ; Mon, 8 Feb 1999 18:19:49 -0500 (EST) Original-Received: from raptor.IRO.UMontreal.CA (raptor.IRO.UMontreal.CA [132.204.26.133]) by degusse.IRO.UMontreal.CA (8.9.1/8.9.1) with ESMTP id SAA17638 for ; Mon, 8 Feb 1999 18:19:22 -0500 (EST) Original-Received: (from pinard@localhost) by raptor.IRO.UMontreal.CA (8.8.8/8.8.8) id SAA10379; Mon, 8 Feb 1999 18:19:22 -0500 (EST) Original-To: "(ding)" X-Face: "b_m|CE6#'Q8fliQrwHl9K,]PA_o'*S~Dva{~b1n*)K*A(BIwQW.:LY?t4~xhYka_.LV?Qq `}X|71X0ea&H]9Dsk!`kxBXlG;q$mLfv_vtaHK_rHFKu]4'<*LWCyUe@ZcI6"*wB5M@[m writes: > > UTF-8 is an encoding scheme, comparable to uuencode. > It is? Then I'm confused... for some reason I was thinking that UTF-8 > *was* Unicode. Nowadays, the UCS may be represented as UCS-2 or UCS-4 internally, yet UCS-2 is often seen externally. The latest Unicode, if I understand things correctly, highly promotes what was once called UTF-16, which is a way of using one or two UCS-2 super-bytes for representing one million characters. There is also UTF-8 which is popular (and nice) and UTF-7 which is getting popular (and ugly). Nicety and ugliness is well hidden in decoders/encoders, so it does not really matter in practice. UTF-7 is a MIME related invention, it does not come from Unicode nor ISO. There also are other encodings, but they are obsolent enough to not be worth mentioning. -- François Pinard mailto:pinard@iro.umontreal.ca Join the free Translation Project! http://www.iro.umontreal.ca/~pinard