From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/1462 Path: news.gmane.org!not-for-mail From: Radek Podgorny Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: runit not collecting zombies Date: Mon, 02 Jul 2007 14:42:14 +0200 Message-ID: <4688F2A6.5070805@podgorny.cz> References: <20070611131112.GA1576@home.power> <20070618134516.GA1560@home.power> <20070619181325.23252.qmail@a92f927aabd53f.315fe32.mid.smarden.org> <20070619190751.GC27090@home.power> <20070620162325.26345.qmail@7d91355cde742c.315fe32.mid.smarden.org> <20070620165736.GC12963@home.power> <20070620183532.4571.qmail@9f638fd8b69905.315fe32.mid.smarden.org> <46876927.5020108@podgorny.cz> <20070702082801.27191.qmail@b7ca43d472c5fa.315fe32.mid.smarden.org> <4688E02B.70108@podgorny.cz> <20070702121414.13011.qmail@b021801bed4952.315fe32.mid.smarden.org> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1183380140 24054 80.91.229.12 (2 Jul 2007 12:42:20 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 2 Jul 2007 12:42:20 +0000 (UTC) To: supervision@list.skarnet.org Original-X-From: supervision-return-1699-gcsg-supervision=m.gmane.org@list.skarnet.org Mon Jul 02 14:42:18 2007 connect(): Connection refused Return-path: Envelope-to: gcsg-supervision@gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.50) id 1I5LEO-0001oa-0z for gcsg-supervision@gmane.org; Mon, 02 Jul 2007 14:42:16 +0200 Original-Received: (qmail 7979 invoked by uid 76); 2 Jul 2007 12:42:37 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 7969 invoked from network); 2 Jul 2007 12:42:37 -0000 User-Agent: Thunderbird 2.0.0.4 (X11/20070615) In-Reply-To: <20070702121414.13011.qmail@b021801bed4952.315fe32.mid.smarden.org> X-Enigmail-Version: 0.95.1 Xref: news.gmane.org gmane.comp.sysutils.supervision.general:1462 Archived-At: Yeah, the PPID of the zombies is 1 for sure. Actually, I'm not experiencing it with lighttpd (I've just found a similar problem on the web with better explanation in hope it helps). There are basically two types of zombies on my system. Lots of sshd zombies (I don't know where they come from, maybe automated login attempts...) and lots of arp zombies. ARP does not for at all AFAIK. It's there because I have a python script which executes arp and run that script from cron. Radek P. Gerrit Pape wrote: > On Mon, Jul 02, 2007 at 01:23:23PM +0200, Radek Podgorny wrote: >> Well, actually I'm the original poster. Unfortunately I can't do any > > Ups, sorry. > >> The number of processes doesn't have to be "huge" and they don't need to >> be "short lived" either (AFAIK). The parent pid of the zombies is 1. > > When reading your initial post and http://trac.lighttpd.net/trac/ticket/978 > I concluded that this is not a runit problem, otherwise the patch posted > to the link above should not work. As I see it, lighttpd version 1.4.15 > has a problem when run with the -D switch. It doesn't wait() for its > children that are spawned on startup. If run without -D, it detaches > afterwards through a double fork which makes the zombies go, but it > doesn't with -D: > > # lighttpd -D -f /etc/lighttpd/lighttpd.conf & > [1] 10362 > # ps --ppid 10362 > PID TTY TIME CMD > 10363 pts/1 00:00:00 create-mime.ass > 10364 pts/1 00:00:00 include-conf-en > # > > Please check again the parent pid of the zombies through 'ps -ef' and/or > /proc//status. > > Thanks, Gerrit. >