From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=none autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail3-relais-sop.national.inria.fr (mail3-relais-sop.national.inria.fr [192.134.164.104]) by yquem.inria.fr (Postfix) with ESMTP id 8FC81BC6B for ; Thu, 11 Oct 2007 23:19:04 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgAAAFAvDkfAXQInh2dsb2JhbACOSgEBAQgKKQ X-IronPort-AV: E=Sophos;i="4.21,261,1188770400"; d="scan'208";a="4384946" Received: from concorde.inria.fr ([192.93.2.39]) by mail3-smtp-sop.national.inria.fr with ESMTP; 11 Oct 2007 23:19:04 +0200 Received: from mail1-relais-roc.national.inria.fr (mail1-relais-roc.national.inria.fr [192.134.164.82]) by concorde.inria.fr (8.13.6/8.13.6) with ESMTP id l9BLJ31X028203 (version=TLSv1/SSLv3 cipher=RC4-SHA bits=128 verify=OK) for ; Thu, 11 Oct 2007 23:19:04 +0200 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgAAAFAvDkfUGyodimdsb2JhbACOSgEBAQgEBiIH X-IronPort-AV: E=Sophos;i="4.21,261,1188770400"; d="scan'208";a="2801730" Received: from smtp3-g19.free.fr ([212.27.42.29]) by mail1-smtp-roc.national.inria.fr with ESMTP; 11 Oct 2007 23:19:03 +0200 Received: from smtp3-g19.free.fr (localhost.localdomain [127.0.0.1]) by smtp3-g19.free.fr (Postfix) with ESMTP id 6DD7817B54F; Thu, 11 Oct 2007 23:19:03 +0200 (CEST) Received: from [192.168.0.4] (rke75-3-82-229-183-156.fbx.proxad.net [82.229.183.156]) by smtp3-g19.free.fr (Postfix) with ESMTP id 192E317B551; Thu, 11 Oct 2007 23:19:02 +0200 (CEST) Message-ID: <470E92B2.2040601@frisch.fr> Date: Thu, 11 Oct 2007 23:16:34 +0200 From: Alain Frisch User-Agent: Thunderbird 2.0.0.6 (Windows/20070728) MIME-Version: 1.0 To: skaller Cc: Caml-list ml Subject: Re: [Caml-list] Re: Rope is the new string References: <20071009.122004.251675959.Christophe.Troestler+ocaml@umh.ac.be> <200710091440.48381.jon@ffconsultancy.com> <20071009155702.GB26282@snarc.org> <6f9f8f4a0710090942u2afe6c5erc2d5b11ecfff2253@mail.gmail.com> <20071009165520.GA27507@snarc.org> <6f9f8f4a0710091032r3dace80bi4f9b584ae5056675@mail.gmail.com> <20071009195119.GA29263@snarc.org> <875c7e070710091504j203ff78y8703fc5c22086671@mail.gmail.com> <20071011130354.GA7016@snarc.org> <1192110864.6045.12.camel@rosella.wigram> <20071011142141.GA8001@snarc.org> <1192114097.6184.7.camel@rosella.wigram> In-Reply-To: <1192114097.6184.7.camel@rosella.wigram> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Miltered: at concorde with ID 470E9347.000 by Joe's j-chkmail (http://j-chkmail . ensmp . fr)! X-Spam: no; 0.00; frisch:01 frisch:01 ffff:98 wrote:01 caml-list:01 int:01 alain:01 alain:01 string:02 bits:05 bits:05 converting:05 routines:06 machine:09 actually:10 skaller wrote: > In that case, you can use an int Array.t for Unicode provided > it is only 31 bit OR you have a 64 bit machine. These routines > should help converting to and from UTF-8: Unicode code points will always fit in 31 bits and 23 bits are actually enough for now (the largest valid assigned value is 0x10ffff). -- Alain