From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/1469 Path: news.gmane.org!not-for-mail From: Alex Efros Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: runit not collecting zombies Date: Sat, 7 Jul 2007 10:16:57 +0300 Organization: asdfGroup Inc., http://powerman.asdfGroup.com/ Message-ID: <20070707071657.GA1517@home.power> References: <20070603111056.15978.qmail@3deb4a0e5d8414.315fe32.mid.smarden.org> <20070611131112.GA1576@home.power> <20070618134516.GA1560@home.power> <20070619181325.23252.qmail@a92f927aabd53f.315fe32.mid.smarden.org> <20070619190751.GC27090@home.power> <20070620162325.26345.qmail@7d91355cde742c.315fe32.mid.smarden.org> <20070620165736.GC12963@home.power> <20070620183532.4571.qmail@9f638fd8b69905.315fe32.mid.smarden.org> <20070623044205.GA1594@home.power> <20070626095920.6195.qmail@3e147d410b1c2c.315fe32.mid.smarden.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1183792621 21571 80.91.229.12 (7 Jul 2007 07:17:01 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 7 Jul 2007 07:17:01 +0000 (UTC) To: supervision@list.skarnet.org Original-X-From: supervision-return-1706-gcsg-supervision=m.gmane.org@list.skarnet.org Sat Jul 07 09:16:58 2007 connect(): Connection refused Return-path: Envelope-to: gcsg-supervision@gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.50) id 1I74XK-0008Tx-MA for gcsg-supervision@gmane.org; Sat, 07 Jul 2007 09:16:58 +0200 Original-Received: (qmail 391 invoked by uid 76); 7 Jul 2007 07:17:19 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 385 invoked from network); 7 Jul 2007 07:17:19 -0000 Mail-Followup-To: supervision@list.skarnet.org Content-Disposition: inline In-Reply-To: <20070626095920.6195.qmail@3e147d410b1c2c.315fe32.mid.smarden.org> User-Agent: Mutt/1.5.13 (2006-08-11) Xref: news.gmane.org gmane.comp.sysutils.supervision.general:1469 Archived-At: Hi! On Tue, Jun 26, 2007 at 09:59:20AM +0000, Gerrit Pape wrote: > But I may be blind, let's see whether reaping zombies at least every 5 > seconds helps. One difference I already noticed. Every 1 minute cron run several `chpst -L` like this one: */1 * * * * ( cd /var/www/soft.p/html && exec chpst -L .lib/var/.lock.service runsvdir .lib/service/ &>/dev/null ) & and with this patch I notice zombies produced by this command collected with ~5-seconds delay: home ~ # date; ps ax | grep Z Sat Jul 7 10:06:00 EEST 2007 home ~ # date; ps ax | grep Z Sat Jul 7 10:06:01 EEST 2007 2544 ? Z 0:00 [sh] 2545 ? Z 0:00 [sh] 2548 ? Z 0:00 [sh] 2550 ? Z 0:00 [sh] 2552 ? Z 0:00 [sh] 2553 ? Z 0:00 [sh] 2556 ? Z 0:00 [sh] home ~ # date; ps ax | grep Z Sat Jul 7 10:06:06 EEST 2007 2544 ? Z 0:00 [sh] 2545 ? Z 0:00 [sh] 2548 ? Z 0:00 [sh] 2550 ? Z 0:00 [sh] 2552 ? Z 0:00 [sh] 2553 ? Z 0:00 [sh] 2556 ? Z 0:00 [sh] home ~ # date; ps ax | grep Z Sat Jul 7 10:06:08 EEST 2007 This situation repeated every 1 minute, that's why I think it's related to crontab line shown above. I notice this right after reboot, so this can't be related to 'non-reaping zombies issue' discussed in this thread. Looks like it's just behaviour of this patch. > Is anything printed to the console when the zombie problem arises? This problem usually arises on remote servers, so I can't check console... but I'll look at kernel log. > To be sure that runit is the problem, could you boot one of your systems > into sysvinit to see if it has the same problem? No, sorry. I've no configured /etc/inittab and /etc/{init.d,conf.d}/ on my servers, only /etc/runit/{1,2,3}. -- WBR, Alex.