From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/53741 Path: main.gmane.org!not-for-mail From: Oliver Scholz Newsgroups: gmane.emacs.gnus.general Subject: Re: Gnus: UTF-8 and compatibility with other MUAs Date: Sat, 16 Aug 2003 21:18:51 +0200 Sender: ding-owner@lists.math.uh.edu Message-ID: References: NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1061062461 16177 80.91.224.253 (16 Aug 2003 19:34:21 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 16 Aug 2003 19:34:21 +0000 (UTC) Original-X-From: ding-owner+M2284@lists.math.uh.edu Sat Aug 16 21:34:20 2003 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19o6od-0000kM-00 for ; Sat, 16 Aug 2003 21:34:20 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19o6nG-00053M-00; Sat, 16 Aug 2003 14:32:55 -0500 Original-Received: from sclp3.sclp.com ([64.157.176.121]) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 19o6n8-00053E-00 for ding@lists.math.uh.edu; Sat, 16 Aug 2003 14:32:46 -0500 Original-Received: (qmail 67382 invoked by alias); 16 Aug 2003 19:32:46 -0000 Original-Received: (qmail 67377 invoked from network); 16 Aug 2003 19:32:45 -0000 Original-Received: from main.gmane.org (80.91.224.249) by sclp3.sclp.com with SMTP; 16 Aug 2003 19:32:45 -0000 Original-Received: from list by main.gmane.org with local (Exim 3.35 #1 (Debian)) id 19o6oC-0003Ei-00 for ; Sat, 16 Aug 2003 21:33:52 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-To: ding@gnus.org Original-Received: from sea.gmane.org ([80.91.224.252]) by main.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 19o6oB-0003Ea-00 for ; Sat, 16 Aug 2003 21:33:51 +0200 Original-Received: from news by sea.gmane.org with local (Exim 3.35 #1 (Debian)) id 19o6n4-0004B7-00 for ; Sat, 16 Aug 2003 21:32:42 +0200 Original-Lines: 37 Original-X-Complaints-To: usenet@sea.gmane.org X-Attribution: os X-Face: "HgH2sgK|bfH$;PiOJI6|qUCf.ve<51_Od(%ynHr?=>znn#~#oS>",F%B8&\vus),2AsPYb -n>PgddtGEn}s7kH?7kH{P_~vu?]OvVN^qD(L)>G^gDCl(U9n{:d>'DkilN!_K"eNzjrtI4Ya6;Td% IZGMbJ{lawG+'J>QXPZD&TwWU@^~A}f^zAb[Ru;CT(UA]c& User-Agent: Gnus/5.1002 (Gnus v5.10.2) Emacs/21.3.50 (windows-nt) Cancel-Lock: sha1:x0PXTXQedOW7SYdbwhOabMG1xpE= Precedence: bulk Xref: main.gmane.org gmane.emacs.gnus.general:53741 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:53741 Simon Josefsson writes: > Oliver Scholz writes: [Klingon and Tengwar in Unicode] > Wasn't the Klingon proposal for Unicode rejected? Tengwar has been a > proposal for ten years, or so, and nothing has happend, as far as I > know. I have no idea. I was just looking for exotic examples and these two were the second and third ones that came to my mind. [...] > UTF-16? It's not even a well define encoding scheme, two files may > contain the exact same Unicode code points, but may differ in a binary > comparison, due to byte ordering. That's what the byte order mark is for. > And concatenating two UTF-16 strings from different sources requires > knowledge about the encoding. And surrogate pairs complicate matters > as well. Why do you think that surrogate pairs complicate matters? There can't be any confusion whether an arbitrary 16 bit value is part a surrogate pair or not; and if it is, whether it is the higher surrogate or the lower one. As for concatenating I'd say this depends on whether the tools are able to deal with it. But I do have to admit that I have zero experience with UTF-16. I don't know how good it is in daily use. I use only UTF-8. I mentioned UTF-16 only because I am told that it is important in some areas (Java, MS Windows, XML ...). Oliver -- 29 Thermidor an 211 de la Révolution Liberté, Egalité, Fraternité!