From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/2030 Path: news.gmane.org!not-for-mail From: Charlie Brady Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: hello - hanging services Date: Tue, 17 Aug 2010 13:38:42 -0400 (EDT) Message-ID: References: <20100817190803.41e8257f.jean.bruenn@ip-minds.de> <20100817192422.a157e85f.jean.bruenn@ip-minds.de> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Trace: dough.gmane.org 1282066723 28267 80.91.229.12 (17 Aug 2010 17:38:43 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 17 Aug 2010 17:38:43 +0000 (UTC) Cc: supervision@list.skarnet.org To: Jean-Michel Bruenn Original-X-From: supervision-return-2265-gcsg-supervision=m.gmane.org@list.skarnet.org Tue Aug 17 19:38:42 2010 Return-path: Envelope-to: gcsg-supervision@lo.gmane.org Original-Received: from antah.skarnet.org ([212.85.147.14]) by lo.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1OlQ7V-0000lM-V2 for gcsg-supervision@lo.gmane.org; Tue, 17 Aug 2010 19:38:42 +0200 Original-Received: (qmail 6308 invoked by uid 76); 17 Aug 2010 17:40:46 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 6274 invoked from network); 17 Aug 2010 17:40:46 -0000 X-X-Sender: charlieb@e-smith.charlieb.ott.istop.com In-Reply-To: <20100817192422.a157e85f.jean.bruenn@ip-minds.de> Xref: news.gmane.org gmane.comp.sysutils.supervision.general:2030 Archived-At: On Tue, 17 Aug 2010, Jean-Michel Bruenn wrote: > Hello, > > thanks for your answer. Wouldn't it be a good improvement for runit, > if it would take care of hanging tasks, also? You gotta detect them first, which is a non-trivial problem (algorithmically impossible in general - http://en.wikipedia.org/wiki/Halting_problem). > There's "run", "finish" > and the "log" stuff - wouldn't it be possible to add "check" as script, check script already exists, but not what you are suggesting: http://manpages.ubuntu.com/manpages/jaunty/man8/sv.8.html > which is running a command all X seconds and if it gets a response it > knows "ah okay, the service is still running" and if it gets no > response "oh, the service seems to have died, let's restart it"? > > Of course, totally optional, up to the user whether to use that or not. > > Difficult to implement? Yes. Please check the archives - this has been discussed previously. > Cheers > > On Tue, 17 Aug 2010 > 13:13:55 -0400 (EDT) Charlie Brady > wrote: > > > > > On Tue, 17 Aug 2010, Jean-Michel Bruenn wrote: > > > > > Hey, > > > > > > i'm curious what happens with hung(hanging?) services (or zombies) is it > > > possible with runit to detect those and restart the service? > > > > hung/hanging services and zombies are different things. A zombie is a > > process which doesn't exist - it's just a process remnant - a status > > report which the kernel is hanging onto waiting for someone to ask for it. > > A hung/hanging services is a real process, but one which isn't doing what > > you want it to do. runit does not detect and restart those. It only > > restarts processes which have died. > > >