From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.sysutils.supervision.general/601 Path: main.gmane.org!not-for-mail From: Charlie Brady Newsgroups: gmane.comp.sysutils.supervision.general Subject: Re: supervising postfix Date: Sat, 16 Oct 2004 16:11:45 -0400 (EDT) Message-ID: References: <7F25AD67-1FA9-11D9-8DD8-000A9598BFB2@annvix.org> NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Trace: sea.gmane.org 1097957521 20752 80.91.229.6 (16 Oct 2004 20:12:01 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sat, 16 Oct 2004 20:12:01 +0000 (UTC) Cc: supervision@list.skarnet.org Original-X-From: supervision-return-840-gcsg-supervision=m.gmane.org@list.skarnet.org Sat Oct 16 22:11:48 2004 Return-path: Original-Received: from antah.skarnet.org ([212.85.147.14] ident=qmailr) by deer.gmane.org with smtp (Exim 3.35 #1 (Debian)) id 1CIuu4-0007Mc-00 for ; Sat, 16 Oct 2004 22:11:48 +0200 Original-Received: (qmail 4958 invoked by uid 76); 16 Oct 2004 20:12:09 -0000 Mailing-List: contact supervision-help@list.skarnet.org; run by ezmlm List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-Archive: Original-Received: (qmail 4952 invoked from network); 16 Oct 2004 20:12:08 -0000 X-X-Sender: charlieb@e-smith.charlieb.ott.istop.com Original-To: Vincent Danen In-Reply-To: <7F25AD67-1FA9-11D9-8DD8-000A9598BFB2@annvix.org> Xref: main.gmane.org gmane.comp.sysutils.supervision.general:601 X-Report-Spam: http://spam.gmane.org/gmane.comp.sysutils.supervision.general:601 On Sat, 16 Oct 2004, Vincent Danen wrote: > >> $daemon_directory/master 2>&1 > >> > >> I can't use exec for master because if I do I get this written to my > >> mail.log: > >> > >> Oct 9 14:31:46 test postfix/master[1941]: fatal: unable to set > >> session > >> and process group ID: Operation not permitted ... > >> I'm really stumped on this one... > > > > You'll either need to ensure that the run script is not a process group > > leader (remove -P from runsvdir, and possibly add "chpst -P" to most > > other > > run scripts), or fix postfix to turn the fatal error into a warning. > > runsvdir doesn't run with -P. I tried using chpst -P on postfix, but > that didn't work. I'm not too terribly interested in changing all the > runscripts to chpst -P every other service (I haven't had the need to > do it for any yet). It's a defensive measure. you can't control when or if a process will kill its own process group. And you don't want any of those processes taking out all your stage 2. You won't have the need for it, until you have the need for it! > Patching postfix is not my idea of a good time, either. I'd prefer to > not mangle as much software as possible because it becomes a > maintenance nuisance. Sure, but you already have a maintenance problem, right now. Postfix doesn't run for you. If you are not using -P anywhere, then maybe you've found a bug with postfix, and it is trying multiple times to become process group leader or something. Have you straced it, so you can see what is being called when? > I think what I may end up doing is calling "postfix start" from stage 2 > if something like /etc/sysconfig/postfix contains "START=yes" or > something similar. Then in stage 3 I'll issue a "postfix stop". Goes > against how I like to do things, but it seems like "master" is doing a > bit of supervision on it's own so instead of using (on Annvix anyways) > "srv stop postfix" one would have to issue "postfix stop". I dislike > that it needs to be different, but at least this way I don't have to > fall back to a traditional initscript. I could then have a runscript > for service postfix that just checks every few seconds to make sure > that master is still running, and if it is, sleep for another 5 seconds > and then do another check. If master doesn't seem to be running, then > just issue "postfix start" and sleep again. > > A bit of a compromise, but I think it might be the best solution. Sounds aweful :-( --- Charlie