From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/1470 Path: news.gmane.org!not-for-mail From: Charlie Brady Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: runit not collecting zombies Date: Sat, 7 Jul 2007 14:13:48 -0400 (EDT) Message-ID: References: <20070603111056.15978.qmail@3deb4a0e5d8414.315fe32.mid.smarden.org> <20070611131112.GA1576@home.power> <20070618134516.GA1560@home.power> <20070619181325.23252.qmail@a92f927aabd53f.315fe32.mid.smarden.org> <20070619190751.GC27090@home.power> <20070620162325.26345.qmail@7d91355cde742c.315fe32.mid.smarden.org> <20070620165736.GC12963@home.power> <20070620183532.4571.qmail@9f638fd8b69905.315fe32.mid.smarden.org> <20070623044205.GA1594@home.power> <20070626095920.6195.qmail@3e147d410b1c2c.315fe32.mid.smarden.org> <20070707071657.GA1517@home.power> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Trace: sea.gmane.org 1183832033 26090 80.91.229.12 (7 Jul 2007 18:13:53 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 7 Jul 2007 18:13:53 +0000 (UTC) Cc: supervision@list.skarnet.org To: Alex Efros Original-X-From: supervision-return-1707-gcsg-supervision=m.gmane.org@list.skarnet.org Sat Jul 07 20:13:52 2007 connect(): Connection refused Return-path: Envelope-to: gcsg-supervision@gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.50) id 1I7En1-00052M-M3 for gcsg-supervision@gmane.org; Sat, 07 Jul 2007 20:13:51 +0200 Original-Received: (qmail 3212 invoked by uid 76); 7 Jul 2007 18:14:12 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 3206 invoked from network); 7 Jul 2007 18:14:12 -0000 X-X-Sender: charlieb@e-smith.charlieb.ott.istop.com In-Reply-To: <20070707071657.GA1517@home.power> Xref: news.gmane.org gmane.comp.sysutils.supervision.general:1470 Archived-At: On Sat, 7 Jul 2007, Alex Efros wrote: > On Tue, Jun 26, 2007 at 09:59:20AM +0000, Gerrit Pape wrote: >> But I may be blind, let's see whether reaping zombies at least every 5 >> seconds helps. > > One difference I already noticed. Every 1 minute cron run several `chpst -L` > like this one: Why? That looks like a very strange thing to do. > */1 * * * * ( cd /var/www/soft.p/html && exec chpst -L .lib/var/.lock.service runsvdir .lib/service/ &>/dev/null ) & So every minute cron will run a shell script, and then wait for it to finish. Each shell script forks a subshell in the background and then exits, so cron no longer waits. The subshell is reparented to process 1. When it exits, it will become a zombie until process 1 reaps its status. Why are you running the subshell? Why do you background it? Why are you throwing away any error output from chpst/runsvdir? What happens if your cron line is: */1 * * * * chpst -L /var/www/soft.p/html/.lib/var/.lock.service runsvdir /var/www/soft.p/html/.lib/var/.lib/service/ ? Why are you starting a new runsvdir every minute? What are you actually trying to achieve? - perhaps someone can suggest a less "unusual" design.