From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id XAA12831; Fri, 18 Jun 2004 23:39:56 +0200 (MET DST) X-Authentication-Warning: pauillac.inria.fr: majordomo set sender to owner-caml-list@pauillac.inria.fr using -f Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id XAA12820 for ; Fri, 18 Jun 2004 23:39:53 +0200 (MET DST) X-SPAM-Warning: Sending machine is listed in blackholes.five-ten-sg.com Received: from eposta.kablonet.com.tr ([62.248.102.66]) by concorde.inria.fr (8.12.10/8.12.10) with SMTP id i5ILdpSH030680 for ; Fri, 18 Jun 2004 23:39:52 +0200 Received: (qmail 87628 invoked by uid 1007); 18 Jun 2004 21:48:41 -0000 Received: from exa@kablonet.com.tr by eposta.kablonet.com.tr by uid 0 with qmail-scanner-1.21 (clamdscan: 0.70-rc. Clear:RC:0(81.214.24.132):. Processed in 0.574729 secs); 18 Jun 2004 21:48:41 -0000 Received: from unknown (HELO orion) (exa@kablonet.com.tr@81.214.24.132) by 0 with SMTP; 18 Jun 2004 21:48:40 -0000 From: Eray Ozkural Reply-To: erayo@cs.bilkent.edu.tr Organization: Bilkent University CS Dept. To: Brian Hurt Subject: Re: [Caml-list] Great Programming Language Shootout Revived Date: Sat, 19 Jun 2004 00:39:44 +0300 User-Agent: KMail/1.6.51 Cc: Sebastien Ferre , Ocaml Mailing List References: In-Reply-To: MIME-Version: 1.0 Content-Disposition: inline Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Message-Id: <200406190039.44804.exa@kablonet.com.tr> X-Miltered: at concorde with ID 40D36127.000 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Loop: caml-list@inria.fr X-Spam: no; 0.00; eray:01 ozkural:01 caml-list:01 shootout:01 2004:99 indentation:01 disadvantage:01 rank:99 newlines:01 indenting:01 shorter:01 eray:01 ozkural:01 erayo:01 bilkent:01 Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk On Friday 18 June 2004 18:45, Brian Hurt wrote: > On Fri, 18 Jun 2004, Sebastien Ferre wrote: > > What about counting characters instead (counting several spaces in > > a row as only one character) ? As most languages just don't mind > > where you cut the lines, this seems to me a more fair measure. > > Also, this would give complete freedom on the indentation style > > without resulting in any disadvantage for the language rank. > > > > Just a suggestion :) > > Actually, the idea I like is to pick a standard compression algorithm- say > gzip, or bzip2- and compress the files. This actually gives you a decent > approximation of the amount of information the soure code contains- and > entire sidesteps a lot of issues. In addition to newlines and indenting, > it also minimizes the effect of variable name length. For example, the > average name length for ocaml programs is probably a lot shorter than the > average name length for java programs. A note. This is an approximation of kolmogorov complexity which is regarded as a universal measure of information content. In recent research, it was observed that statistical compressors (PPM) do a little better than bzip2 in terms of discriminatory power (ie. information distance). Best Regards, -- Eray Ozkural (exa) Comp. Sci. Dept., Bilkent University, Ankara KDE Project: http://www.kde.org http://www.cs.bilkent.edu.tr/~erayo Malfunction: http://malfunct.iuma.com GPG public key fingerprint: 360C 852F 88B0 A745 F31B EA0F 7C07 AE16 874D 539C ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners