From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/1502 Path: news.gmane.org!not-for-mail From: "Radek Podgorny" Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: runit not collecting zombies Date: Wed, 12 Sep 2007 18:10:10 +0200 (CEST) Message-ID: <8609.19873598522$1189613438@news.gmane.org> References: <20070715223553.GU3925@run.galis.org> <20070716000927.GY23517@home.power> <47939.::ffff:77.75.72.5.1189601606.squirrel@mail.podgorny.cz> <20070912143557.GC12043@home.power> <20070912150047.GD12043@home.power> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: sea.gmane.org 1189613438 17113 80.91.229.12 (12 Sep 2007 16:10:38 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 12 Sep 2007 16:10:38 +0000 (UTC) To: supervision@list.skarnet.org Original-X-From: supervision-return-1737-gcsg-supervision=m.gmane.org@list.skarnet.org Wed Sep 12 18:10:30 2007 Return-path: Envelope-to: gcsg-supervision@gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.50) id 1IVUn5-0000RC-CD for gcsg-supervision@gmane.org; Wed, 12 Sep 2007 18:10:11 +0200 Original-Received: (qmail 8455 invoked by uid 76); 12 Sep 2007 16:10:33 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 8447 invoked from network); 12 Sep 2007 16:10:33 -0000 In-Reply-To: User-Agent: SquirrelMail/1.4.8 X-Priority: 3 (Normal) Importance: Normal Xref: news.gmane.org gmane.comp.sysutils.supervision.general:1502 Archived-At: > > On Wed, 12 Sep 2007, Alex Efros wrote: > >> On Wed, Sep 12, 2007 at 10:55:18AM -0400, Charlie Brady wrote: >>>> ... because there different software which also >>>> produce unreaped zombies (like ssh). >>> You keep saying that, but I continue to doubt it. If you can document >>> that >>> that occurs, I'm sure that the ssh maintainers will want to fix the >>> bug. >> >> Are you listen to me? My solution with cron is just INCREASE >> PROBABILITY, >> nothing more. > > Sure. And if you decrease the probability to zero, you don't have a > problem any more. The problem is you can't push it zero. :-( Imagine the system as a car, zombies being accidents and runit (or init in general - reaping zombies) being the seatbelts and airbag. You can be the best driver in the world but still, would you buy a car without seatbelts and airbags? ;-) Accidents shouldn't happen (we have rules, right?) but actually, they do... :-( > >> What about Radek Podgorny - I think he doesn't use cron to >> start runsvdir, and he has issue with ssh..? > > I don't know the details of his problem. > >> And what does mean 'I continue to doubt' - you think we're lying to >> you?! > > No, I just haven't seen any evidence. I suspect you are misinterpreting > the misbehaviour of some program started from ssh, and attributing that > program's failures to ssh. ssh is always used to start other programs, and > other programs can always generate zombies. There's nothing ssh can do to > prevent a child program of it from creating zombies. If ssh is at fault, > details would be useful, because then someone can find the fault in ssh > and fix it. Until someone provides evidence that ssh is creating zombies, > then it's quite reasonable for me to assume that it isn't doing so. > > I remain convinced that your problem can be fixed by using runsvdir and > runsv as they are designed to be used. We can advise you how to do that. > But if you'd prefer to do strange things with cron, and continue to have > problems, and point fingers at runit/glibc/kernel, then you have a free > choice to do that. > >