9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
* [9fans] installs which hang
@ 2014-06-07 15:45 erik quanstrom
  2014-06-07 17:16 ` Ramakrishnan Muthukrishnan
  2014-06-07 23:21 ` Steve Simon
  0 siblings, 2 replies; 4+ messages in thread
From: erik quanstrom @ 2014-06-07 15:45 UTC (permalink / raw)
  To: 9fans

i don't think this has been mentioned in a while, so i wanted
to quickly jot down a few tricks for looking at a "hung" machine.

there are three main things that can hang things up on initial boot
that aren't related to the kernel misbehaving
- unclean shutdown forcing a fs check.  this can take 15-20 minutes on a
big drive, or slow vm disk.
- dhcp (should be limited to 5-10 minutes on sources, and ~30s on atom)
- timesync.  i saw this issue one in 2008, so i don't remember much about it.
- interrupts incorrectly mapped.  (even acpi can get it wrong.)

if you can hit enter on the console, and have it recognized, you can
have some confidence that interrupts are working.

if you can type ^T^Tp and get a process listing, that's even better.
you may (especially if you have a serial console) be able to figure out who
is misbehaving.

if you can type ^T^Tq and get the scheduler dump that might tell you
if you have a lot of runnable processes.

if you can type ^T^Ti and get the interrupt dump, that might tell you
if a some hardware isn't interrupting.

if your machine isn't making it this far, there isn't too much that can be
easily done, unless you can pxe boot.  i usually put prints in the boot to
see where things are going wrong.  i had the pleasure of doing that yesterday
putting new locks in the pae kernel.  (i really need to use charles' GS:
extern register trick to avoid this MACHP nonsense.)

- erik



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [9fans] installs which hang
  2014-06-07 15:45 [9fans] installs which hang erik quanstrom
@ 2014-06-07 17:16 ` Ramakrishnan Muthukrishnan
  2014-06-07 23:21 ` Steve Simon
  1 sibling, 0 replies; 4+ messages in thread
From: Ramakrishnan Muthukrishnan @ 2014-06-07 17:16 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

On Sat, Jun 7, 2014 at 9:15 PM, erik quanstrom <quanstro@quanstro.net> wrote:
> i don't think this has been mentioned in a while, so i wanted
> to quickly jot down a few tricks for looking at a "hung" machine.
>
> there are three main things that can hang things up on initial boot
> that aren't related to the kernel misbehaving
> - unclean shutdown forcing a fs check.  this can take 15-20 minutes on a
> big drive, or slow vm disk.
> - dhcp (should be limited to 5-10 minutes on sources, and ~30s on atom)
> - timesync.  i saw this issue one in 2008, so i don't remember much about it.
> - interrupts incorrectly mapped.  (even acpi can get it wrong.)
>
> if you can hit enter on the console, and have it recognized, you can
> have some confidence that interrupts are working.
>
> if you can type ^T^Tp and get a process listing, that's even better.
> you may (especially if you have a serial console) be able to figure out who
> is misbehaving.
>
> if you can type ^T^Tq and get the scheduler dump that might tell you
> if you have a lot of runnable processes.
>
> if you can type ^T^Ti and get the interrupt dump, that might tell you
> if a some hardware isn't interrupting.
>
> if your machine isn't making it this far, there isn't too much that can be
> easily done, unless you can pxe boot.  i usually put prints in the boot to
> see where things are going wrong.  i had the pleasure of doing that yesterday
> putting new locks in the pae kernel.  (i really need to use charles' GS:
> extern register trick to avoid this MACHP nonsense.)

Thank you. This is very useful.

With the 9atom install on virtualbox which hangs at init: starting
/bin/rc, I can see the process listing with ^T^Tp. The scheduler
listing just show "nrdy 0". ^T^Ti does not show anything. Keyboard is
active.

--
  Ramakrishnan



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [9fans] installs which hang
  2014-06-07 15:45 [9fans] installs which hang erik quanstrom
  2014-06-07 17:16 ` Ramakrishnan Muthukrishnan
@ 2014-06-07 23:21 ` Steve Simon
  2014-06-08  8:05   ` erik quanstrom
  1 sibling, 1 reply; 4+ messages in thread
From: Steve Simon @ 2014-06-07 23:21 UTC (permalink / raw)
  To: 9fans

> - timesync.  i saw this issue one in 2008, so i don't remember much about it.

I think this was a bug in cron. When the time lept forward as timesync corrected
the time at boot cron would try to run all the intervening events and hang the machine.

cron now ignores time changes if they are big.

Having said this my memory is a little hazy too...

-Steve



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [9fans] installs which hang
  2014-06-07 23:21 ` Steve Simon
@ 2014-06-08  8:05   ` erik quanstrom
  0 siblings, 0 replies; 4+ messages in thread
From: erik quanstrom @ 2014-06-08  8:05 UTC (permalink / raw)
  To: 9fans

On Sat Jun  7 19:22:41 EDT 2014, steve@quintile.net wrote:
> > - timesync.  i saw this issue one in 2008, so i don't remember much about it.
>
> I think this was a bug in cron. When the time lept forward as timesync corrected
> the time at boot cron would try to run all the intervening events and hang the machine.
>
> cron now ignores time changes if they are big.
>
> Having said this my memory is a little hazy too...

it can also lock up the machine updating "too fast".  without help
from cron.  i just don't remember the details of why this resulted
in a hang.

- erik



^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2014-06-08  8:05 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-06-07 15:45 [9fans] installs which hang erik quanstrom
2014-06-07 17:16 ` Ramakrishnan Muthukrishnan
2014-06-07 23:21 ` Steve Simon
2014-06-08  8:05   ` erik quanstrom

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).