From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id PAA06028; Wed, 28 Apr 2004 15:04:50 +0200 (MET DST) X-Authentication-Warning: pauillac.inria.fr: majordomo set sender to owner-caml-list@pauillac.inria.fr using -f Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id PAA05977 for ; Wed, 28 Apr 2004 15:04:49 +0200 (MET DST) Received: from aomori.annexia.org (annexia.force9.co.uk [212.56.101.183]) by concorde.inria.fr (8.12.10/8.12.10) with ESMTP id i3SD4mYM021353 for ; Wed, 28 Apr 2004 15:04:48 +0200 Received: from rich by aomori.annexia.org with local (Exim 3.36 #1 (Debian)) id 1BIok4-0005Ss-00 for ; Wed, 28 Apr 2004 14:04:48 +0100 Date: Wed, 28 Apr 2004 14:04:48 +0100 Cc: caml-list@inria.fr Subject: Re: [Caml-list] Re: Common IO structure Message-ID: <20040428130448.GB20907@redhat.com> References: <016401c42bc4$b6438840$19b0e152@warp> <20040428.004358.45522587.yoriyuki@mbg.ocn.ne.jp> <016501c42c73$24e64b30$ef01a8c0@warp> <20040428.015800.126758722.yoriyuki@mbg.ocn.ne.jp> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040428.015800.126758722.yoriyuki@mbg.ocn.ne.jp> User-Agent: Mutt/1.5.5.1+cvs20040105i From: Richard Jones X-Miltered: at concorde by Joe's j-chkmail ("http://j-chkmail.ensmp.fr")! X-Loop: caml-list@inria.fr X-Spam: no; 0.00; caml-list:01 2004:99 0900,:01 yamagata:01 yoriyuki:01 empirical:01 model:01 encodings:01 stupid:01 obstacle:01 vanier:01 ltd:98 unicode:01 caml:01 bytes:02 Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk On Wed, Apr 28, 2004 at 01:58:00AM +0900, Yamagata Yoriyuki wrote: > I'm interested in an emprical evidence, though. I can't claim empirical evidence, but I do know that UTF-8 input/output is increasingly important to me. We've moved to a model where all the websites we're doing (even for UK customers) are UTF-8 throughout. PostgreSQL is UNICODE, webserver serves UTF-8 pages, all strings manipulated inside the Caml code are really UTF-8 strings (although we rarely deal with them as anything other than pure bytes). The only time we convert to other encodings is for handling stupid mail systems like Hotmail which cannot deal with UTF-8. Rich. -- Richard Jones. http://www.annexia.org/ http://www.j-london.com/ Merjis Ltd. http://www.merjis.com/ - improving website return on investment "One serious obstacle to the adoption of good programming languages is the notion that everything has to be sacrificed for speed. In computer languages as in life, speed kills." -- Mike Vanier ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners