From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: X-Spam-Status: No, score=0.3 required=5.0 tests=AWL autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail4-relais-sop.national.inria.fr (mail4-relais-sop.national.inria.fr [192.134.164.105]) by yquem.inria.fr (Postfix) with ESMTP id 0D0F2BB84 for ; Tue, 9 Sep 2008 07:31:54 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApgBAHymxUjRVcbhlGdsb2JhbACSDD4BAQEBCQMKBxEDm0CGTwEChAY X-IronPort-AV: E=Sophos;i="4.32,363,1217800800"; d="scan'208";a="28958756" Received: from rv-out-0506.google.com ([209.85.198.225]) by mail4-smtp-sop.national.inria.fr with ESMTP; 09 Sep 2008 07:31:52 +0200 Received: by rv-out-0506.google.com with SMTP id f6so1605031rvb.3 for ; Mon, 08 Sep 2008 22:31:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:cc:in-reply-to:mime-version:content-type :content-transfer-encoding:content-disposition:references; bh=nPKqI0vguTwX+7ENhxvF64ELK9yuWZ4H/NSeUV+9q3s=; b=D2vClONRuM61renFB84ugKJdaHh80Fop/Zp8qC1ifTaCvxSNMS7eKBW+rpwsB5m3cY 4FWJeA+0NmyDaYs6+mNk0KHHhKI8Ub1ROTdjI3yY5uHexqPr/hm5GToxVLZoRPAuGdB8 T6NAmQXRuIRGPMYWvNv2DZ8WTPuWE+74DzLBM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:cc:in-reply-to:mime-version :content-type:content-transfer-encoding:content-disposition :references; b=Ye+33txuFahXeIaLLsRedBRrwg18Fj5tS2RQnie2OSUFHvy+F8C4HnfXARa4uCCGXA g3J9XG7KeBAn30hlNQhosmE6Pyrp+e3njAF9OHlTbqUQBxbt9PJRXjFlJ61tEbeGB59J 2dYHLn611gFdeAj5CeXSXD5GSUm6fwvQaNXbQ= Received: by 10.141.170.9 with SMTP id x9mr9440942rvo.90.1220938311202; Mon, 08 Sep 2008 22:31:51 -0700 (PDT) Received: by 10.141.5.12 with HTTP; Mon, 8 Sep 2008 22:31:51 -0700 (PDT) Message-ID: Date: Mon, 8 Sep 2008 22:31:51 -0700 From: "Nathaniel Gray" To: "Jon Harrop" Subject: Re: [Caml-list] New Ocaml Plug-in for NetBeans Cc: "Maxence Guesdon" , caml-list@yquem.inria.fr In-Reply-To: <200809080210.57000.jon@ffconsultancy.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <1217061979.488ae45b452b9@webmail.inescporto.pt> <200808201732.07275.jon@ffconsultancy.com> <200809080210.57000.jon@ffconsultancy.com> X-Spam: no; 0.00; ocaml:01 ocaml:01 cheers:01 plug-in:98 token:01 wrote:01 wrote:01 caml-list:01 string:02 optimizing:03 gzip:04 gzip:04 caltech:05 caltech:05 exists:05 On Sun, Sep 7, 2008 at 6:10 PM, Jon Harrop wrote: > On Monday 08 September 2008 00:31:47 Nathaniel Gray wrote: >> In fact, gzip does a pretty fine job of optimizing .annot files. The >> source file this .annot came from is 15K, the .annot file is 78K, and >> gzipping it gets it down to 9K. > > Tokenizing that .annot format by simply mapping each string onto a unique > token and writing out the mapping first will shrink such files enormously > whilst still being easily dissected from any language. Moreover, the benefit > is super-linear... Sure, I had the same thought, but gzip already exists, doesn't require approval from INRIA, and .gz is easily read in just about any language (including OCaml). Once the .annot files are smaller than the source files I'm completely willing to generate them all the time. Cheers, -n8 -- >>>-- Nathaniel Gray -- Caltech Computer Science ------> >>>-- Mojave Project -- http://mojave.cs.caltech.edu -->