supervision - discussion about system services, daemon supervision, init, runlevel management, and tools such as s6 and runit
 help / color / mirror / Atom feed
From: Charlie Brady <charlieb-supervision@budge.apana.org.au>
To: supervision@list.skarnet.org
Subject: runsv and EAGAIN
Date: Fri, 11 Jul 2008 16:46:43 -0400 (EDT)	[thread overview]
Message-ID: <Pine.LNX.4.64.0807111638180.21272@e-smith.charlieb.ott.istop.com> (raw)


I'm just putting together a new sytem using CentOS 5.2 (i686 - 
2.6.18-92.el5PAE), and have runsv chewing up lots of CPU:

...
  1202 root      25   0   108   28   20 R 10.2  0.0   3:13.97 runsv
  1300 root      25   0   112   36   20 R 10.2  0.0   3:17.01 runsv
  1306 root      25   0   108   28   20 R 10.2  0.0   3:18.52 runsv
  1322 root      25   0   112   36   20 R 10.2  0.0   3:16.97 runsv
  1203 root      25   0   108   32   20 R  9.9  0.0   3:19.21 runsv
  1228 root      25   0   112   36   20 R  9.9  0.0   3:21.39 runsv
  1271 root      25   0   112   32   20 R  9.9  0.0   3:24.37 runsv
  1200 root      25   0   112   32   20 R  9.6  0.0   3:23.17 runsv
...

Here's a sample strace:

rt_sigprocmask(SIG_UNBLOCK, [TERM], NULL, 8) = 0
rt_sigprocmask(SIG_UNBLOCK, [CHLD], NULL, 8) = 0
poll([{fd=3, events=POLLIN}, {fd=9, events=POLLIN, revents=POLLIN}, 
{fd=11, events=POLLIN}], 3, 1000020) = 1
rt_sigprocmask(SIG_BLOCK, [TERM], NULL, 8) = 0
rt_sigprocmask(SIG_BLOCK, [CHLD], NULL, 8) = 0
read(3, 0xbfa256c3, 1)                  = -1 EAGAIN (Resource temporarily 
unavailable)
waitpid(-1, 0xbfa256bc, WNOHANG)        = 0
read(9, "", 1)                          = 0
read(11, 0xbfa256c3, 1)                 = -1 EAGAIN (Resource temporarily 
unavailable)
gettimeofday({1215792640, 591768}, NULL) = 0
rt_sigprocmask(SIG_UNBLOCK, [TERM], NULL, 8) = 0
rt_sigprocmask(SIG_UNBLOCK, [CHLD], NULL, 8) = 0
poll([{fd=3, events=POLLIN}, {fd=9, events=POLLIN, revents=POLLIN}, 
{fd=11, events=POLLIN}], 3, 1000020) = 1
rt_sigprocmask(SIG_BLOCK, [TERM], NULL, 8) = 0
rt_sigprocmask(SIG_BLOCK, [CHLD], NULL, 8) = 0
read(3, 0xbfa256c3, 1)                  = -1 EAGAIN (Resource temporarily 
unavailable)
waitpid(-1, 0xbfa256bc, WNOHANG)        = 0
read(9, "", 1)                          = 0
read(11, 0xbfa256c3, 1)                 = -1 EAGAIN (Resource temporarily 
unavailable)
gettimeofday({1215792640, 593313}, NULL) = 0
...

I think this might be a case of a kernel being "this dumb" - iopause.c:

...
  	/* XXX: some kernels apparently need x[0] even if len is 0 */
  	/* XXX: how to handle EAGAIN? are kernels really this dumb? */
  	/* XXX: how to handle EINVAL? when exactly can this happen? */
...


...
ERRORS
        The poll() function shall fail if:

        EAGAIN The allocation of internal data structures failed but a 
subsequent request may succeed.

...

The system isn't obviously out of memory:

[root@amc-upgrade:]# free
              total       used       free     shared    buffers     cached
Mem:        255608     213396      42212          0      29240     109132
-/+ buffers/cache:      75024     180584
Swap:       265064          0     265064
~
[root@amc-upgrade:]#

[root@amc-upgrade:]# cat /proc/meminfo
MemTotal:       255608 kB
MemFree:         47580 kB
Buffers:         29276 kB
Cached:         109176 kB
SwapCached:          0 kB
Active:         112132 kB
Inactive:        75084 kB
HighTotal:           0 kB
HighFree:            0 kB
LowTotal:       255608 kB
LowFree:         47580 kB
SwapTotal:      265064 kB
SwapFree:       265064 kB
Dirty:            2176 kB
Writeback:           0 kB
AnonPages:       48780 kB
Mapped:           9236 kB
Slab:            15600 kB
PageTables:       2372 kB
NFS_Unstable:        0 kB
Bounce:              0 kB
CommitLimit:    392868 kB
Committed_AS:   206484 kB
VmallocTotal:   772088 kB
VmallocUsed:      2284 kB
VmallocChunk:   769652 kB
HugePages_Total:     0
HugePages_Free:      0
HugePages_Rsvd:      0
Hugepagesize:     2048 kB
~
[root@amc-upgrade:]#




             reply	other threads:[~2008-07-11 20:46 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-07-11 20:46 Charlie Brady [this message]
2008-07-12  1:27 ` Charlie Brady
2008-07-12 16:07   ` Dražen Kačar
2008-07-13  2:46     ` Charlie Brady
2009-01-20 23:21       ` runsv spinning 100% CPU (was Re: runsv and EAGAIN) Charlie Brady
2009-02-10 12:41         ` Gerrit Pape
2009-02-18  0:09           ` Laurent Bercot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Pine.LNX.4.64.0807111638180.21272@e-smith.charlieb.ott.istop.com \
    --to=charlieb-supervision@budge.apana.org.au \
    --cc=supervision@list.skarnet.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).