From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/1485 Path: news.gmane.org!not-for-mail From: "George Georgalis" Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: runit not collecting zombies Date: Sun, 15 Jul 2007 16:18:46 -0400 Message-ID: <20070715201846.GT3925@run.galis.org> References: <20070618134516.GA1560@home.power> <20070619181325.23252.qmail@a92f927aabd53f.315fe32.mid.smarden.org> <20070619190751.GC27090@home.power> <20070620162325.26345.qmail@7d91355cde742c.315fe32.mid.smarden.org> <20070620165736.GC12963@home.power> <20070620183532.4571.qmail@9f638fd8b69905.315fe32.mid.smarden.org> <20070623044205.GA1594@home.power> <20070626095920.6195.qmail@3e147d410b1c2c.315fe32.mid.smarden.org> <20070715144704.GS23517@home.power> <20070715190757.GW23517@home.power> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1184530729 1774 80.91.229.12 (15 Jul 2007 20:18:49 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 15 Jul 2007 20:18:49 +0000 (UTC) To: supervision@list.skarnet.org Original-X-From: supervision-return-1722-gcsg-supervision=m.gmane.org@list.skarnet.org Sun Jul 15 22:18:48 2007 Return-path: Envelope-to: gcsg-supervision@gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.50) id 1IAAYK-0006l5-Iw for gcsg-supervision@gmane.org; Sun, 15 Jul 2007 22:18:48 +0200 Original-Received: (qmail 4678 invoked by uid 76); 15 Jul 2007 20:19:09 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 4673 invoked from network); 15 Jul 2007 20:19:09 -0000 Mail-Followup-To: supervision@list.skarnet.org Content-Disposition: inline In-Reply-To: <20070715190757.GW23517@home.power> Xref: news.gmane.org gmane.comp.sysutils.supervision.general:1485 Archived-At: On Sun, Jul 15, 2007 at 10:07:57PM +0300, Alex Efros wrote: >Hi! > ># date; ps ax | grep Z | wc >Sun Jul 15 19:00:29 GMT 2007 > 371 2227 16523 > ># ps -ef ax | grep perl | grep Z >root 9072 1 0 18:58 pts/1 Z 0:00 [perl] >root 9094 1 0 18:59 pts/1 Z 0:00 [perl] >root 9183 1 0 18:59 pts/1 Z 0:00 [perl] >root 9192 1 0 18:59 pts/1 Z 0:00 [perl] >root 9261 1 0 18:59 pts/1 Z 0:00 [perl] >root 9267 1 0 18:59 pts/1 Z 0:00 [perl] >root 9273 1 0 18:59 pts/1 Z 0:00 [perl] > ># perl -e 'fork && exit; sleep 1; print "$$ done\n"' ># 9392 done > ># date; ps ax | grep Z | wc >Sun Jul 15 19:01:12 GMT 2007 > 372 2233 16567 > ># ps -ef ax | grep perl | grep Z >root 9072 1 0 18:58 pts/1 Z 0:00 [perl] >root 9094 1 0 18:59 pts/1 Z 0:00 [perl] >root 9183 1 0 18:59 pts/1 Z 0:00 [perl] >root 9192 1 0 18:59 pts/1 Z 0:00 [perl] >root 9261 1 0 18:59 pts/1 Z 0:00 [perl] >root 9267 1 0 18:59 pts/1 Z 0:00 [perl] >root 9273 1 0 18:59 pts/1 Z 0:00 [perl] >root 9392 1 0 19:01 pts/1 Z 0:00 [perl] > >Can anybody help me debug this issue? >I've attached tar file with contents of /proc/9392/, maybe this helps. try using lsof to determine what file descriptors are open and focus on attaching them somewhere ie /dev/null when you fork the process. Your defunct perl process are probably waiting for EOF from the fork. maybe you could close stdout/stderr of the fork? // George -- George Georgalis, information systems scientist <