From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail2-relais-roc.national.inria.fr (mail2-relais-roc.national.inria.fr [192.134.164.83]) by yquem.inria.fr (Postfix) with ESMTP id A6DF6BC57 for ; Mon, 6 Sep 2010 12:52:05 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApwEACdlhExQW+UMgWdsb2JhbACTZ40rFQEBFiIiu1aFPQSIb4Ep X-IronPort-AV: E=Sophos;i="4.56,324,1280700000"; d="scan'208";a="58680288" Received: from lo.gmane.org ([80.91.229.12]) by mail2-smtp-roc.national.inria.fr with ESMTP; 06 Sep 2010 12:51:47 +0200 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1OsZIg-0008TB-QM for caml-list@inria.fr; Mon, 06 Sep 2010 12:51:46 +0200 Received: from ks368928.kimsufi.com ([94.23.39.26]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 06 Sep 2010 12:51:46 +0200 Received: from sylvain by ks368928.kimsufi.com with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 06 Sep 2010 12:51:46 +0200 X-Injected-Via-Gmane: http://gmane.org/ To: caml-list@inria.fr From: Sylvain Le Gall Subject: Re: ocamlmpi reduce_int_array, showstopper? Date: Mon, 6 Sep 2010 10:51:37 +0000 (UTC) Message-ID: References: X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: ks368928.kimsufi.com User-Agent: slrn/pre1.0.0-11 (Linux) X-Spam: no; 0.00; le-gall:01 ocamlmpi:01 showstopper:01 mikelin:01 ocamlmpi:01 trivial:01 mikelin:01 uname:01 usr:01 bug:01 commited:01 2009:98 1.2:98 wrote:01 integer:01 On 04-09-2010, Mike Lin wrote: > Hi Sylvain, Xavier, > I have encountered what seems to be a serious problem with > reduce_int_array in ocamlmpi, namely, in a trivial test case it > successfully returns incorrect results. This is my test code > [...] > > [mikelin@honest3 ~/]$ uname -a > Linux honest3.ncsa.uiuc.edu 2.6.18-92.1.10.el5_lustre.1.6.6smp-perfctr > #7 SMP Tue Nov 10 10:41:00 CST 2009 x86_64 x86_64 x86_64 GNU/Linux > [mikelin@honest3 ~/]$ which mpiexec > /usr/local/mvapich2-1.2-intel-ofed-1.2.5.5/bin/mpiexec > > see also: http://www.ncsa.illinois.edu/UserInfo/Resources/Hardware/Intel64Cluster/ > > I know the ocamlmpi code has been stable for some time, but I wonder > if it's been tested on x86_64? Let me know if you're able to reproduce > this. > I can confirm the bug. I have commited a fix in the svn. Though, I am not an MPI expert. I think the problem was related to a double in-place re-encoding of integer in the array. I'll wait for Xavier review to do a release. Regards, Sylvain Le Gall