From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on yquem.inria.fr X-Spam-Level: * X-Spam-Status: No, score=1.5 required=5.0 tests=AWL,HTML_10_20,HTML_MESSAGE, SPF_NEUTRAL autolearn=disabled version=3.1.3 X-Original-To: caml-list@yquem.inria.fr Delivered-To: caml-list@yquem.inria.fr Received: from mail4-relais-sop.national.inria.fr (mail4-relais-sop.national.inria.fr [192.134.164.105]) by yquem.inria.fr (Postfix) with ESMTP id 02A96BC69 for ; Sun, 30 Sep 2007 05:22:14 +0200 (CEST) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgAAADiz/kbRVYC/kmdsb2JhbACCOjeLQgEBAQEHAgYRFg X-IronPort-AV: E=Sophos;i="4.21,213,1188770400"; d="scan'208";a="17015885" Received: from fk-out-0910.google.com ([209.85.128.191]) by mail4-smtp-sop.national.inria.fr with ESMTP; 30 Sep 2007 05:22:14 +0200 Received: by fk-out-0910.google.com with SMTP id b27so4057619fka for ; Sat, 29 Sep 2007 20:22:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type; bh=MsHbOTzCSvY1eXdoVUkX77enypRzP8vbx/nZFazRK5Y=; b=WTWHebDdBJACblVR5pHNCC+56hxBIktVDDlRC2mWXQj6yVuv5L2Lul9RxdGC6YuSMGiIviAX8DDcDUgIbXS54zL0YEaX9kRjj3NlFCQLqnGoKUQPgOU1jStf0hvAZHsXjymLqVEthiB8Ck/1BYZdtymzAb0xsH2Tzc+34BoOUyk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:mime-version:content-type; b=YFWEAcXZ9q2QTF8qV3yD+o9eNhWvO6+gU+oWQYHjULPYCOx4wni+u951o0daw/Y8BLegjaQqOJ+kjga0GZVz9s4uWqvqueD7w0wKahKxLMKKqtHcbUmMK5N5oq/5EPs8Rh7scGQ2OYUqGeMrCt+GbFSTgWrb13NVz6ijTXqpxhc= Received: by 10.82.165.13 with SMTP id n13mr3621169bue.1191122532928; Sat, 29 Sep 2007 20:22:12 -0700 (PDT) Received: by 10.82.138.2 with HTTP; Sat, 29 Sep 2007 20:22:12 -0700 (PDT) Message-ID: <164378d10709292022y54832f37we542a09ba6366944@mail.gmail.com> Date: Sun, 30 Sep 2007 00:22:12 -0300 From: Rod To: caml-list@yquem.inria.fr, "Andres Varon" Subject: (help) new kind of error <-> OcamlMPI MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_2391_19054036.1191122532924" X-Spam: no; 0.00; ocamlmpi:01 ocamlmpi:01 ocamlc:01 usr:01 lib:01 usr:01 lib:01 ocaml:01 ocamlc:01 ocaml:01 runtime:01 runtime:01 mate:98 1.01:98 1.01:98 ------=_Part_2391_19054036.1191122532924 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi there!! Here we go! Well.. I'm still having some troubles with OcamlMPI. Mr. Andres Varon, I just tried to compile test.ml file (i.e.: regression test), which is available on the latest release provided by Mr. Xavier Leroy, by doing what you suggested me to do. Unfortunately, it still didn't work. Thanks anyway, mate! So, after some time working on that issue, I've eventually got it compiled by doing the following: [rodiney@homer ocamlmpi-1.01]$ ocamlc -o testmpi mpi.cma unix.cma test.ml-ccopt -L/usr/lib/lam -ccopt -L/usr/lib/ocaml/ocamlmpi -ccopt -llam -ccopt -lutil Here is the output: [rodiney@homer ocamlmpi-1.01]$ ocamlc -o testmpi mpi.cma unix.cma test.ml-ccopt -L/usr/lib/lam -ccopt -L/usr/lib/ocaml/ocamlmpi -ccopt -llam -ccopt -lutil File "test.ml", line 144, characters 6-9: Warning Y: unused variable res. [rodiney@homer ocamlmpi-1.01]$ As you all can see, there is only a warning message. I mean.. no big deal. So, that command line works pretty good. The problem occurs when I try to execute 'testmpi' file. I'm getting the following message when doing this: rodiney@homer ocamlmpi-1.01]$ mpirun -np 1 ./testmpi --------------------------------------------------------------------------------------- It seems that there is no lamd running on the host homer. This indicates that the LAM/MPI runtime environment is not operating. The LAM/MPI runtime environment is necessary for MPI programs to run (the MPI program tired to invoke the "MPI_Init" function). Please run the "lamboot" command the start the LAM/MPI runtime environment. See the LAM/MPI documentation for how to invoke "lamboot" across multiple machines. --------------------------------------------------------------------------------------- The same message appears by doing 'mpiexec -n 1 ./testmpi' as well. I would like to remember you, guys, that I'm using MPICH2 implementation of MPI (mpich2-1.0.6). So, although MPICH2 is already installed and configured properly, the message displayed above is just asking to start 'lamboot'. However, 'lamboot' is a tool provided by LAM/MPI: yet another an MPI implementation. Why it is asking me for 'lamboot' if I'm using the 'mpd' daemon (i.e.: the process manager) from MPICH2?? Hmm. OK.. I've just started 'lamboot' and I did 'mpiexec -n 1 ./testmp' (or 'mpirun -np 1 ./testmpi') once more. Below is the output: [rodiney@homer ocamlmpi-1.01]$ mpiexec -n 1 ./testmpi MPI process rank 0 (n0, p3915) caught a SIGSEGV in MPI_Errhandler_free. Rank (0, MPI_COMM_WORLD): Call stack within LAM: Rank (0, MPI_COMM_WORLD): - MPI_Errhandler_free() Rank (0, MPI_COMM_WORLD): - MPI_Errhandler_set() Rank (0, MPI_COMM_WORLD): - main() So, that's the point. What should I do to solve this problem and get 'testmpi' working?? Please, help me!!! I'm almost loosing my mind!! LOL I'm looking forward for any help!!! Thanks in advance and sorry for such a long message. That's it!! Best regards, Rodiney ------=_Part_2391_19054036.1191122532924 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi there!! Here we go!
Well.. I'm still having some troubles with OcamlMPI.
Mr. Andres Varon, I just tried to compile test.ml file (i.e.: regression test), which is available on the latest release provided by Mr. Xavier Leroy, by doing what you suggested me to do. Unfortunately, it still didn't work. Thanks anyway, mate!

So, after some time working on that issue, I've eventually got it compiled by doing the following:

[rodiney@homer ocamlmpi-1.01]$ ocamlc -o testmpi mpi.cma unix.cma test.ml -ccopt -L/usr/lib/lam -ccopt -L/usr/lib/ocaml/ocamlmpi -ccopt -llam -ccopt -lutil

Here is the output:

[rodiney@homer ocamlmpi-1.01]$ ocamlc -o testmpi mpi.cma unix.cma test.ml -ccopt -L/usr/lib/lam -ccopt -L/usr/lib/ocaml/ocamlmpi -ccopt -llam -ccopt -lutil
File "test.ml", line 144, characters 6-9:
Warning Y: unused variable res.
[rodiney@homer ocamlmpi-1.01]$

As you all can see, there is only a warning message. I mean.. no big deal. So, that command line works pretty good.
The problem occurs when I try to execute 'testmpi' file.
I'm  getting the following message when doing this:

rodiney@homer ocamlmpi-1.01]$ mpirun -np 1 ./testmpi
---------------------------------------------------------------------------------------
It seems that there is no lamd running on the host homer.

This indicates that the LAM/MPI runtime environment is not operating.
The LAM/MPI runtime environment is necessary for MPI programs to run
(the MPI program tired to invoke the "MPI_Init" function).

Please run the "lamboot" command the start the LAM/MPI runtime
environment.  See the LAM/MPI documentation for how to invoke
"lamboot" across multiple machines.
---------------------------------------------------------------------------------------

The same message appears by doing 'mpiexec -n 1 ./testmpi' as well.

I would like to remember you, guys, that I'm using MPICH2 implementation of MPI (mpich2-1.0.6).
So, although MPICH2 is already installed and configured properly, the message displayed above is just asking to start 'lamboot'. However, 'lamboot' is a tool provided by LAM/MPI: yet another an MPI implementation. Why it is asking me for 'lamboot' if I'm using the 'mpd' daemon (i.e.: the process manager) from MPICH2??

Hmm. OK.. I've just started 'lamboot' and I did 'mpiexec -n 1 ./testmp' (or 'mpirun -np 1 ./testmpi') once more. Below is the output:

[rodiney@homer ocamlmpi-1.01]$ mpiexec -n 1 ./testmpi
MPI process rank 0 (n0, p3915) caught a SIGSEGV in MPI_Errhandler_free.
Rank (0, MPI_COMM_WORLD): Call stack within LAM:
Rank (0, MPI_COMM_WORLD):  - MPI_Errhandler_free()
Rank (0, MPI_COMM_WORLD):  - MPI_Errhandler_set()
Rank (0, MPI_COMM_WORLD):  - main()

So, that's the point. What should I do to solve this problem and get 'testmpi' working??
Please, help me!!! I'm almost loosing my mind!! LOL
I'm looking forward for any help!!!
Thanks in advance and sorry for such a long message. That's it!!

Best regards,

                    Rodiney ------=_Part_2391_19054036.1191122532924--