From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.1 required=5.0 tests=AWL autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104]) by yquem.inria.fr (Postfix) with ESMTP id 80053BBAF for ; Thu, 13 Aug 2009 08:10:06 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApoEABdLg0rZbprC/2dsb2JhbADSYYQZBYFMWw X-IronPort-AV: E=Sophos;i="4.43,373,1246831200"; d="scan'208";a="32370907" Received: from grisu.bik-gmbh.de ([217.110.154.194]) by mail3-smtp-sop.national.inria.fr with ESMTP/TLS/DHE-RSA-AES256-SHA; 13 Aug 2009 08:10:06 +0200 Received: from [192.168.125.196] (ip196.bik-gmbh.de [192.168.125.196]) by grisu.bik-gmbh.de (8.14.3/8.14.3) with ESMTP id n7D6A00w001402; Thu, 13 Aug 2009 08:10:00 +0200 (CEST) (envelope-from hars@bik-gmbh.de) Message-ID: <4A83AE38.7070005@bik-gmbh.de> Date: Thu, 13 Aug 2009 08:10:00 +0200 From: Florian Hars User-Agent: Thunderbird 2.0.0.22 (X11/20090608) MIME-Version: 1.0 To: Dario Teixeira Cc: caml-list@yquem.inria.fr Subject: Re: [Caml-list] Storing UTF-8 in plain strings References: <51021.92432.qm@web111506.mail.gq1.yahoo.com> In-Reply-To: <51021.92432.qm@web111506.mail.gq1.yahoo.com> X-Enigmail-Version: 0.95.7 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam: no; 0.00; hars:01 hars:01 bik-gmbh:01 caml-list:01 strings:01 florian:03 florian:03 reasoning:07 schrieb:08 storing:08 kind:13 www:84 problems:16 code:17 the:27 Dario Teixeira schrieb: > So, can someone find any problems with this reasoning? No, the kind of compatibility with legacy code you described is one of the original design goals of UTF-8, see http://www.cl.cam.ac.uk/~mgk25/ucs/utf-8-history.txt - Florian.