From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail4-relais-sop.national.inria.fr (mail4-relais-sop.national.inria.fr [192.134.164.105]) by walapai.inria.fr (8.13.6/8.13.6) with ESMTP id p7PAi68M017315 for ; Thu, 25 Aug 2011 12:44:06 +0200 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgQBAKkmVk7RVaE2kGdsb2JhbABCFqdjCBQBAQEBCQkNBxQEIYFAAQEBAQEBARICJgYBATcBBAsLDiYSNAEFARwGNYdQAgKdfQqPDgGOOAWFbGCfeTyDZQ X-IronPort-AV: E=Sophos;i="4.68,280,1312149600"; d="scan'208";a="106581651" Received: from mail-fx0-f54.google.com ([209.85.161.54]) by mail4-smtp-sop.national.inria.fr with ESMTP/TLS/RC4-SHA; 25 Aug 2011 12:44:04 +0200 Received: by fxe4 with SMTP id 4so3092368fxe.27 for ; Thu, 25 Aug 2011 03:44:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=subject:mime-version:content-type:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to:x-mailer; bh=3IyyZmYaaDA5ASNfxVsnZ78Vowg4/ExHg+ZILJsSvdg=; b=jv8yWr6rYOEOJt1QNx0LxJKig8K6d5FAOo6uSh8zZ8mDujqdKFKJn23yo/F08qI/g3 P/0Q9HxS7JTaAMToL8q6qiTOfbsQ1G5pgwcNuyXvhUdtS9wd4QlI+5YMzoLcLzj90O5b KptmeO3sr2fkLnNy3o5zG4PCMvMEf7jzEc5p0= Received: by 10.223.6.91 with SMTP id 27mr8987566fay.145.1314269043476; Thu, 25 Aug 2011 03:44:03 -0700 (PDT) Received: from coruscant.kosmos.all (ip-95-223-170-32.unitymediagroup.de [95.223.170.32]) by mx.google.com with ESMTPS id 14sm373639faf.32.2011.08.25.03.44.00 (version=SSLv3 cipher=OTHER); Thu, 25 Aug 2011 03:44:01 -0700 (PDT) Mime-Version: 1.0 (Apple Message framework v1244.3) Content-Type: text/plain; charset=us-ascii From: Benedikt Meurer In-Reply-To: <1314267668.3496.62.camel@thinkpad> Date: Thu, 25 Aug 2011 12:43:59 +0200 Cc: caml-list@inria.fr, Fischbach Marcell Message-Id: <7E896935-C167-48C8-A2DD-BB178D965B19@googlemail.com> References: <93199F3B-E9CF-4D93-9B2B-BAAB03F4FC08@googlemail.com> <4EF51F29-D437-4F6F-9C91-DBEA3D4C3EB8@googlemail.com> <1314218451.3496.42.camel@thinkpad> <11577B43-4270-4BA4-AA77-7FDEFB4B563B@googlemail.com> <1314267668.3496.62.camel@thinkpad> To: Gerd Stolpmann X-Mailer: Apple Mail (2.1244.3) Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by walapai.inria.fr id p7PAi68M017315 Subject: Re: [Caml-list] Linear Scan Register Allocator for ocamlopt/ocamlnat On Aug 25, 2011, at 12:21 , Gerd Stolpmann wrote: >> Reran the benchmark with 32bit ocaml on the MBP (Early 2011), results are available at: >> >> http://ps.informatik.uni-siegen.de/~meurer/tmp/linscan-i7-i386-timings.pdf > > Hard to interpret. I have the impression that the compile times for > graph coloring on i386 are lower than on amd64 - has maybe to do with > the number of registers - but those for linear scanning are about the > same. (I'm comparing here with the amd64 numbers taken on the same > machine.) > > The graphs for the runtimes are on a logarithmic scale. Do you also have > raw numbers? Of course. These are the raw numbers for the i7 (i386): command opt_gc opt_ls opt_gc/ls ---------------------------------------------------------- binarytree 16 1.096 1.102 0.995 fasta 100 0.008 0.008 1.000 fasta 250000 0.044 0.044 1.000 fasta 25000000 2.767 2.737 1.011 mandelbrot 1000 0.224 0.224 1.000 mandelbrot 4000 3.583 3.570 1.004 meteor 2098 0.578 0.586 0.986 nbody 3000000 1.142 1.168 0.978 spectral 3000 4.963 5.243 0.947 almabench 4.665 4.687 0.995 almabench.unsafe 4.629 4.702 0.984 bdd 0.279 0.285 0.979 boyer 0.566 0.572 0.990 fft 0.421 0.422 0.998 nucleic 0.499 0.513 0.973 quicksort 0.150 0.149 1.007 quicksort.unsafe 0.121 0.121 1.000 sorts 1.889 1.947 0.970 And the raw numbers for the P4: command opt_gc opt_ls opt_gc/ls ---------------------------------------------------------- binarytree 16 2.796 2.780 1.006 fasta 100 0.024 0.024 1.000 fasta 250000 0.404 0.408 0.990 fasta 25000000 33.350 33.402 0.998 mandelbrot 1000 0.892 0.884 1.009 mandelbrot 4000 14.220 14.040 1.013 meteor 2098 1.300 1.312 0.991 nbody 3000000 4.524 4.840 0.935 spectral 3000 51.979 53.919 0.964 almabench 17.261 17.721 0.974 almabench.unsafe 17.317 18.441 0.939 bdd 1.256 1.248 1.006 boyer 1.668 1.672 0.998 fft 3.064 3.080 0.995 nucleic 2.188 2.268 0.965 quicksort 0.340 0.340 1.000 quicksort.unsafe 0.272 0.272 1.000 sorts 6.164 6.180 0.997 > Gerd Benedikt