From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from weis@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id SAA02832 for caml-red; Thu, 11 Jan 2001 18:38:54 +0100 (MET) Received: from nez-perce.inria.fr (nez-perce.inria.fr [192.93.2.78]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id NAA05538 for ; Thu, 11 Jan 2001 13:56:06 +0100 (MET) Received: from mrwall.kal.com (mrwall.kal.com [194.193.14.236]) by nez-perce.inria.fr (8.11.1/8.10.0) with SMTP id f0BCu5T04060 for ; Thu, 11 Jan 2001 13:56:06 +0100 (MET) Received: from mrwall.kal.com [194.193.14.236] (HELO localhost) by mrwall.kal.com (AltaVista Mail V2.0J/2.0J BL25J listener) id 0000_0045_3a5d_ad9a_aeed; Thu, 11 Jan 2001 12:56:58 +0000 Received: from somewhere by smtpxd Message-ID: <3145774E67D8D111BE6E00C0DF418B663AD720@nt.kal.com> From: Dave Berry To: John Max Skaller Cc: Markus Mottl , OCAML Subject: Unicode (was RE: JIT-compilation for OCaml?) Date: Thu, 11 Jan 2001 12:58:53 -0000 MIME-Version: 1.0 X-Mailer: Internet Mail Service (5.0.1460.8) Content-Type: text/plain; charset="iso-8859-1" Sender: weis@pauillac.inria.fr I thought Unicode was a recognised subset of ISO-10646, corresponding to the range 0-2^16. Also, don't Windows NT/2000 use Unicode? My knowledge of C/C++ is probably out of date, but I thought they just used the wide character type, without requiring a particular internal representation. In what way do ISO C/C++ support ISO-10646? (I realise this isn't directly on-topic, but it may be relevant for future extensions to OCaml?) Dave. -----Original Message----- From: John Max Skaller [mailto:skaller@ozemail.com.au] Sent: Thursday, January 11, 2001 7:01 To: Dave Berry Cc: Markus Mottl; OCAML Subject: Re: JIT-compilation for OCaml? Dave Berry wrote: > > This view seems extreme to me. Certainly the Java type system has faults -- > lack of generics being one, lack of enumerated types another, and various > other points as well. But surely Unicode is a useful de facto standard? No. Unicode was abandoned years ago: there is an 'offical' ISO Standard: ISO-10646. There are 2^31 code points, unlike Unicode's 2^16, which is already barely adequate. ISO C and ISO C++ support ISO-10646. Linux runs ISO-10646 (via UTF-8).