Subject: Re: netbsd-4 MP kernel panic
To: None <port-sparc@netbsd.org>
From: John D. Baker <jdbaker@mylinuxisp.com>
List: port-sparc
Date: 07/01/2007 12:18:06
This time around, I built and installed kernels build from the latest
netbsd-4 sources (updated late 28 June).  During the subsequent build
of the userland, I got the same panic again.

When I'd recovered from that, I built and installed kernels with
"options LOCKDEBUG".  During the restarts of the userland build, I got
the panic twice more, but with more information.

The output is below.  The first line is from the non-LOCKDEBUG kernel, the
subsequent two groups are from the LOCKDEBUG kernel.  The files referenced
live on my file server, via NFS.


[...]
xcall(cpu1,0xf00087e4): couldn't ping cpus:panic:  cpu0cpu0: stuck on lock@f0317274


[...]
xcall(cpu1,0xf00087e4): couldn't ping cpus:panic:  cpu0cpu0: stuck on lock@f0329604

syncing disks... 
simple_lock: locking against myself
lock: 0xf0326d24, currently at: /amd/halloran/r0/d2/NetBSD/src/sys/kern/kern_synch.c:1237
on CPU 0 
last locked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/sys_generic.c:1129
last unlocked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/kern_synch.c:744

switching with held simple_lock 0xf035a588 CPU 0 /amd/halloran/r0/d2/NetBSD/src/sys/kern/subr_pool.c:1292

simple_lock: uninitialized lock
lock: 0xf035a588, currently at: /amd/halloran/r0/d2/NetBSD/src/sys/kern/subr_pool.c:935
on CPU 1
last locked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/subr_pool.c:1292
last unlocked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/subr_pool.c:1294


[...]
xcall(cpu0,0xf00087e4): couldn't ping cpus:panic:  cpu1cpu1: stuck on lock@f0329604

syncing disks... 
simple_lock: locking against myself
lock: 0xf0326d24, currently at: /amd/halloran/r0/d2/NetBSD/src/sys/kern/kern_synch.c:1237
on CPU 1 
last locked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/sys_generic.c:1129
last unlocked: /amd/halloran/r0/d2/NetBSD/src/sys/kern/kern_synch.c:744

pool_get(PR_WAITOK) with held simple_lock 0xf5702c68 CPU 1 /amd/halloran/r0/d2/NetBSD/src/sys/kern/tty.c:2487

[ last message repeated 107 times ]
[ system hung ]


Subsequent attempts to finish building userland have failed, but I suspect
local filesystem corruption from the prior panics.  It is being restarted
from scratch.

-- 
John D. Baker, KN5UKS                    NetBSD     Darwin/MacOS X
jdbaker(at)mylinuxisp(dot)com                 OpenBSD            FreeBSD
BSD -- It just sits there and _works_!
GPG fingerprint:  D703 4A7E 479F 63F8 D3F4  BD99 9572 8F23 E4AD 1645