zsh-workers
 help / color / mirror / code / Atom feed
From: Phil Pennock <zsh-workers+phil.pennock@spodhuis.org>
To: zsh-workers@sunsite.dk, 482346@bugs.debian.org
Subject: Re: Bug#482346: zsh doesn't always wait for its children (-> zombie)
Date: Sun, 25 May 2008 15:26:58 -0700	[thread overview]
Message-ID: <20080525222657.GA80449@redoubt.spodhuis.org> (raw)
In-Reply-To: <20080525213719.GV7056@prunille.vinc17.org>

On 2008-05-25 at 23:37 +0200, Vincent Lefevre wrote:
> On 2008-05-24 18:23:21 -0700, Phil Pennock wrote:
> > Then I'd be inclined to start looking into hardware issues, since
> > _something's_ probably getting stuck in disk IO; I'll suspect that
> > before kernel bugs, but it might also be worth seeing if there are other
> > problems with threaded programs on powerpc, if init really can't reap
> > something that has already become a zombie.
> 
> I've looked at /var/log/kern.log and there's something each time
> I interrupted vlc, e.g.
> 
> May 24 14:33:36 ay kernel: Unable to handle kernel paging request for data at address 0x481e7000
> May 24 14:33:36 ay kernel: Faulting instruction address: 0xc00131e8
> May 24 14:33:36 ay kernel: Oops: Kernel access of bad area, sig: 11 [#1]

That's a segfault; the kernel's then oopsing whilst trying to page in
memory to write the coredump; looks like a problem in the MMU logic for
the powerpc.

So, the problems are:

 * vlc is segfaulting when it receives SIGINT;

 * the powerpc Linux kernel has a bug whereby it's ending up not letting
   the parent wait on it (from what I understand of the details so far)
   in some cases, so it looks like the process isn't actually ending and
   transitioning to zombie status; it might be worth talking to the
   architecture maintainers for your distribution, to see about known
   issues; note that even init is unable to reclaim these processes;
   have you tried sending a SIGKILL to force-exit the vlc, to see if
   either zsh or init can reap the process then?

 * zsh is somehow tickling the kernel bug and it might be worth having
   configure logic to deal with this, even after the problem is fixed,
   once we know what it is that's tickling this.

> May 24 14:33:36 ay kernel: note: vlc[21850] exited with preempt_count 1

My nasty suspicious mind thinks that special kernel logic for handling a
weird exit condition, and logging it, is less tested code that's already
doing something different to the default, so this is likely close to the
root cause; no powerpc available for me to test, though.

It seems unlikely that there'd be enough bugs to also have a zombie
contributing to load average, so I suspect that the process has not in
fact exited yet, it's still running, that's where the load comes from.
Does ps(1) actually show the 'Z' for zombie?

-Phil


  reply	other threads:[~2008-05-25 22:27 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20080521235008.GA5600@ay.vinc17.org>
     [not found] ` <20080521235930.GW7056@prunille.vinc17.org>
2008-05-22 23:33   ` Clint Adams
2008-05-23 14:39     ` Bart Schaefer
2008-05-23 14:57       ` Clint Adams
2008-05-23 22:43         ` Vincent Lefevre
2008-05-23 22:45           ` Vincent Lefevre
2008-05-23 23:04             ` Vincent Lefevre
2008-05-24  2:55           ` Clint Adams
2008-05-24 12:44             ` Vincent Lefevre
2008-05-24 14:25               ` Peter Stephenson
2008-05-24 15:27                 ` Stephane Chazelas
2008-05-24 15:58                   ` Stephane Chazelas
2008-05-24 17:53                     ` Clint Adams
2008-05-24 17:41                   ` Bart Schaefer
2008-05-24 18:25                     ` Stephane Chazelas
2008-05-24 23:40               ` Phil Pennock
2008-05-25  0:41                 ` Vincent Lefevre
2008-05-25  1:23                   ` Phil Pennock
2008-05-25 21:37                     ` Vincent Lefevre
2008-05-25 22:26                       ` Phil Pennock [this message]
2008-05-25 23:34                         ` Vincent Lefevre
2008-05-25 23:43                           ` Vincent Lefevre
2008-05-26  0:31                           ` Bart Schaefer
2008-05-25 10:08                   ` Stephane Chazelas
2008-05-25 21:54                     ` Vincent Lefevre

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080525222657.GA80449@redoubt.spodhuis.org \
    --to=zsh-workers+phil.pennock@spodhuis.org \
    --cc=482346@bugs.debian.org \
    --cc=zsh-workers@sunsite.dk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/zsh/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).