Subject: Re: Machine livelock with latest (4.99.48) kernel on sparc64
To: Michael Lorenz <macallan@netbsd.org>
From: Rafal Boni <rafal@pobox.com>
List: current-users
Date: 01/05/2008 12:58:41
Michael Lorenz wrote:
> Hello,
>
> On Jan 5, 2008, at 03:11, Rafal Boni wrote:
>
>> I just rebooted my trusty Netra T1 with a shiny new 4.99.48 kernel and
>> thought I'd kick off a userland build. Things seemed to go swimmingly
>> for a few minutes, then the machine ground to an un-usable state --
>> userland seems to be mostly non-responsive, though the machine is
>> pingable, answers a ^T at a tty (well, it seems to be wedged harder
>> now.. it did for a while after the apparent lockup), and the disk sounds
>> like progress is being made on the build.
>
>> But, I can't get any echo from a tty anymore, and god forbid I should
>> want to log in ;)
>
>> Anyone seeing anything similar? Should I go back to the last-known-good
>> kernel for a while? ;)
>
> Are you using lfs? I see something similar on a dual G4 Mac with OBJ and
> TOOLS on an lfs partition, a userland build reliably triggers it,
> processes hang solid when accessing stuff on lfs, can't be killed and
> the fs can't be unmounted.
No lfs here, just bog-standard ffs w/out softdep. But I did have the
same issue of unkillable processes (killing them from DDB, since I
couldn't get userland response), and when I finally rebooted (again,
from DDB) the system hung unmounting the filesystems as well. Good thing
the LOM can still kick the box and reboot it ;)
--rafal