From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: * X-Spam-Status: No, score=1.1 required=5.0 tests=AWL,SPF_NEUTRAL autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from discorde.inria.fr (discorde.inria.fr [192.93.2.38]) by yquem.inria.fr (Postfix) with ESMTP id F0B59BC69 for ; Tue, 10 Apr 2007 12:08:56 +0200 (CEST) Received: from wr-out-0506.google.com (wr-out-0506.google.com [64.233.184.224]) by discorde.inria.fr (8.13.6/8.13.6) with ESMTP id l3AA8tgL001528 for ; Tue, 10 Apr 2007 12:08:56 +0200 Received: by wr-out-0506.google.com with SMTP id i3so1166885wra for ; Tue, 10 Apr 2007 03:08:55 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=OWZSBTOorE5du98QpE67KpYfhd5xBuR4UKsus5PCfdrFM52onMbcujrhF3HXuqeGS3gJ/BRZksYIouegKeXRqOSCYClz68e2m8Zuj1lrb0VniqRmcS6QWROE38x9cDQDSNpb4QxcPlGTr/TCieqQCtGjSahlY/TmqrINw7kmHCk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=WnZlIDUe5iByod1gToq0LeMAgbmhk/AngMKQfxPbJtLP7u8eEaLTmm134Gyj3pq4a0KnZZJg5ohNuBTkLfIbzrSFewjbqhI2GnoEP0Q79eKGvL0YVpm8Y/QFg416JTvk3Ygx187V5PcrRl22V/J9VE8gyX57kTP1H1wkNK/hdE8= Received: by 10.114.132.5 with SMTP id f5mr2747960wad.1176199734401; Tue, 10 Apr 2007 03:08:54 -0700 (PDT) Received: by 10.114.211.2 with HTTP; Tue, 10 Apr 2007 03:08:54 -0700 (PDT) Message-ID: <53c655920704100308s567a86f0rf189a8ee4a0a3a61@mail.gmail.com> Date: Tue, 10 Apr 2007 12:08:54 +0200 From: "David Baelde" Reply-To: david.baelde@ens-lyon.org To: "Jacques Garrigue" Subject: Re: [Caml-list] Re: Optimizing Array.blit Cc: caml-list@inria.fr In-Reply-To: <20070410.102630.63130168.garrigue@math.nagoya-u.ac.jp> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <53c655920704041519t77930d38y52003888139b9d2c@mail.gmail.com> <53c655920704091713g6a7b11f4xc975e8118cd35fa1@mail.gmail.com> <20070410.102630.63130168.garrigue@math.nagoya-u.ac.jp> X-j-chkmail-Score: MSGID : 461B6237.000 on discorde : j-chkmail score : X : 0/20 1 0.000 -> 1 X-Miltered: at discorde with ID 461B6237.000 by Joe's j-chkmail (http://j-chkmail . ensmp . fr)! X-Spam: no; 0.00; blit:01 gcc:01 piv:98 0.57:98 0.25:98 0.12:98 caml-list:01 surprising:01 optimizing:02 optimizing:02 fold:06 processor:06 quite:07 generic:08 loaded:08 Hi, Thanks a lot for that advice, I was stupid not to try that before. I got a 4 fold improvement too on my PIV desktop machine, which brings the specialized ML code about as fast as the C code. This does not change when I pass -O3 to gcc. However, I ran the test on a dedibox, which has a VIA processor. The machine is loaded so it's difficult to conclude anything, but I get quite always the same surprising figures: without -O3 I get 0.57 for the C code, 0.91 for the generic ML and 0.25 for the specialized ML; with -O3 I get 0.12 for the C code, same for the others. Optimizing is boring... -- David