From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail1-relais-roc.national.inria.fr (mail1-relais-roc.national.inria.fr [192.134.164.82]) by walapai.inria.fr (8.13.6/8.13.6) with ESMTP id q0DH6pAb000754 for ; Fri, 13 Jan 2012 18:06:51 +0100 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApACAN5jEE/U436rk2dsb2JhbABChRCoDCIBAQEBCQkLCRQDIoFyAQEEASNWBQsLGgImAgJXBhMJh3ECBqVlkTiBL4lYgRYEjSWNPoxv X-IronPort-AV: E=Sophos;i="4.71,505,1320620400"; d="scan'208";a="139194465" Received: from moutng.kundenserver.de ([212.227.126.171]) by mail1-smtp-roc.national.inria.fr with ESMTP; 13 Jan 2012 18:06:46 +0100 Received: from office1.lan.sumadev.de (dslb-094-219-216-241.pools.arcor-ip.net [94.219.216.241]) by mrelayeu.kundenserver.de (node=mreu1) with ESMTP (Nemesis) id 0MLTGA-1RmIop3OVu-0001xG; Fri, 13 Jan 2012 18:06:46 +0100 Received: from [192.168.5.106] (dslb-094-219-216-241.pools.arcor-ip.net [94.219.216.241]) by office1.lan.sumadev.de (Postfix) with ESMTPSA id 8DF93C00CE; Fri, 13 Jan 2012 18:06:45 +0100 (CET) From: Gerd Stolpmann To: Anders Fugmann Cc: caml-list In-Reply-To: <4F105C34.3010207@fugmann.net> References: <4F105C34.3010207@fugmann.net> Content-Type: text/plain; charset="UTF-8" Date: Fri, 13 Jan 2012 18:06:50 +0100 Message-ID: <1326474410.14288.116.camel@thinkpad> Mime-Version: 1.0 X-Mailer: Evolution 2.28.3 Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:ir06gHdMVgEnT6vEBnvR6azIF7JtNuJWlJ2uMIq4qHW JHUXJwynAVvTBn1HJzj9wS2RdhyYXpAbrIekKsjQ0O/pHiI001 /NUZb20a0WKoOfGfyTA2eDF/GO0OmBSfyPnymaz2hSuj2DMgW/ L2gDEsUskHFT5jpvfIjo9QAbC+IEzV75Crz9E+kCPjPr58vymT MCfNAk/CsVZmD9/z6UQ1LQKrMdJ242CcVQj9PEBqoLUSA44xa4 hkNwqFpb+J53Nr0KrLmLK61Tzjn2paUFBTBiegcLcnZE+q3jQk oNWpqrLrIKwjRQdevmBEs+5GOlpL6qNrf4wmD82000vn/jKryi K0b50y/0dL1Uv9g45iaMWp6V3AupyQu6mlMph/BHN Subject: Re: [Caml-list] Netmcore problems. Am Freitag, den 13.01.2012, 17:30 +0100 schrieb Anders Fugmann: > Hi, > > I'm having a problem using netmcore from ocamlnet 3.4 library. > > When I try to place a large data structure into Netmcore_array, the > controlling process dies with the following error: > > [Fri Jan 13 16:21:38 2012] [netplex.controller] [alert] Process 28067 > for service netmcore_0 terminated with signal 7 That's sigbus. You get it when the OS does not have memory anymore - but not when the memory is allocated, but first when it is filled with data. You can change this by disallowing that memory is overcommitted (set the kernel param vm.overcommit_memory to 1 or even 2). > The error occurs while trying to initialise a Netmcore_array from an > array of 4*10^6 strings all of length 150. This is done in the context > of the "first" process. > > The controlling process initializes a memory pool by using > > let pool_id = Netmcore_mempool.create_mempool (1024 * 1024 * 1024 * > 10) in > > Running the code on a smaller dataset seems to work. > > I have increased kernel variables shmall and shmmax to very high numbers > (100Gb), but it does not solve the problem. No, this cannot solve it. These variables only control System V shared memory, but Netmcore uses POSIX shared memory. On Linux, you can change the max of this memory by re-mounting /dev/shm, e.g. mount -o remount,size=80% /dev/shm The default is 50% of available RAM. > All ideas are welcome. > > On other question while I'm at it; am I allowed to create multiple > shared arrays from the same memory pool, or do I need to create one pool > for each shared array? The pools can be shared. Gerd > > Regards > Anders Fugmann > > > -- ------------------------------------------------------------ Gerd Stolpmann, Darmstadt, Germany gerd@gerd-stolpmann.de Creator of GODI and camlcity.org. Contact details: http://www.camlcity.org/contact.html Company homepage: http://www.gerd-stolpmann.de *** Searching for new projects! Need consulting for system *** programming in Ocaml? Gerd Stolpmann can help you. ------------------------------------------------------------