Port-sparc64 archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
RC3 LOCKDEBUG panic on 6-way E3k.
I got an RC3 DIAGNOSTICS+DEBUG+LOCKDEBUG kernel running with changes
suggested by Martin. Foor good measure, I eliminated RAIDframe from the
picture by booting a second install from a different disk and started a
build.sh -j8 release-build on it to kill it. Rather than deadlock hard
within 10 minutes, it now survived 37 minutes -- but then it panicked! But
now I have ddb!
(I also noticed in the extra debug output during boot that something set
ddb.onpanic=0, so if this is done in normal kernels too it would explain
why I never got into ddb)
The box is an Enterprise3000 with 6xUltraSPARC-II 400MHz and 4GB RAM and
a dead TODC battery. %-)
Dunno where to go from here, but here's the backtrace, regs and ps
output:
---8<---8<---8<---8<---
panic: kernel diagnostic assertion
"pmap_ctx(LIST_FIRST(&curcpu()->ci_pmap_ctxlist)) != 0" failed: file
"/usr/src/sys/arch/sparc64/sparc64/pmap.c", line 3159
Stopped in pid 26459.1 (sh) at netbsd:cpu_Debugger+0x4: nop
db{2}> bt
__kernassert(1682260, 17071e0, c57, 1707b48, 116e7c88, 1) at
netbsd:__kernassert+0x2c
pmap_activate_pmap(11a5b980, 0, 3, 6, 10848000, 10848000) at
netbsd:pmap_activate_pmap+0x310
lwp_startup(e0018000, 11612440, 0, 0, f587d40, f5a3bb0) at
netbsd:lwp_startup+0x64
lwp_trampoline(0, badcafe, badcafe, badcafe, badcafe, badcafe) at
netbsd:lwp_trampoline
db{2}> show registers
tstate 0x82000606
pc 0x1480f44 cpu_Debugger+0x4
npc 0x1480f48 cpu_Debugger+0x8
ipl 0
y 0x7c5c
g0 0
g1 0x1
g2 0x3a34
g3 0xe0006000
g4 0x18e6c00 ld_prime+0xec40
g5 0
g6 0x31
g7 0x1162b8
o0 0x1706e00 copyright+0x85d90
o1 0x116e7c68
o2 0
o3 0x3f165
o4 0x108580b8
o5 0
o6 0x116e7311
o7 0x13232a8 panic+0x208
l0 0x13aa8d8000
l1 0
l2 0x18d5c0000
l3 0x155140000
l4 0
l5 0
l6 0x18d1c0000
l7 0
i0 0
i1 0
i2 0x64bab3000
i3 0x11202e4000
i4 0x1161244000
i5 0xf587c4000
i6 0x108a800000
i7 0
---8<--- snip all-zero fp regs ---8<---
db{2}> ps
PID LID S CPU FLAGS STRUCT LWP * NAME WAIT
26459> 1 7 2 4 11612440 sh
4680 > 1 7 4 4 11c84ba0 cat
26156 1 3 5 84 13341400 sleep nanoslp
26245 1 3 5 84 13341bc0 sh wait
27637 1 3 4 84 f5afbe0 sh wait
5609 1 3 5 84 11c847c0 nbgmake wait
2985 1 3 4 84 11f30040 sh wait
542 1 3 1 84 1264b360 nbmake select
10811 1 3 5 84 1156ebe0 sh wait
24088 1 3 1 84 1264aba0 nbmake select
1633 1 3 4 84 1187dba0 sh wait
2600 1 3 3 84 11a6ec60 nbmake select
439 1 3 1 84 1156fb60 sh wait
34 1 3 4 84 1156efc0 sh wait
472 1 3 4 84 116137a0 ksh pause
460 1 3 5 84 11612fe0 ksh pause
466 1 3 3 84 11613b80 qmgr kqueue
459 1 3 2 84 1156f780 pickup kqueue
465 1 3 1 84 f5ad020 login wait
451 1 3 0 84 1156e800 cron nanoslp
452 1 3 2 84 1133e020 inetd kqueue
435 1 3 2 84 1156e420 master kqueue
262 1 3 1 84 11612820 sshd select
110 1 3 0 84 1133ebc0 syslogd kqueue
84 1 3 3 84 1133fb40 dhclient select
1 1 3 1 84 f5acc40 init wait
0 61 3 5 204 11208f80 physiod physiod
60 3 5 204 11209b20 vmem_rehash
vmem_rehash
59 3 0 204 f5ae0c0 aiodoned aiodoned
58 3 0 204 f5ae4a0 ioflush syncer
57 3 4 204 f5ae880 pgdaemon pgdaemon
56 3 2 204 f5aec60 raidio0 raidiow
55 3 1 204 f5af040 raid0 rfwcond
54 3 5 204 f5ad7e0 cryptoret
crypto_wait
53 3 2 204 f5ac480 unpgc unpgc
52 3 4 204 f5af420 scsibus2 sccomp
51 3 5 204 f5af800 scsibus1 sccomp
42 3 1 204 f5adbc0 scsibus0 sccomp
41 3 5 204 f5a8080 xcall/5 xcall
40 1 5 204 f5a8460 softser/5
39 1 5 204 f5a8840 softclk/5
38 1 5 204 f5a8c20 softbio/5
37 1 5 204 f5a9000 softnet/5
> 36 7 5 205 f5a93e0 idle/5
35 3 4 204 f5a97c0 xcall/4 xcall
34 1 4 204 f5a9ba0 softser/4
33 1 4 204 f5a4060 softclk/4
32 1 4 204 f5a4440 softbio/4
31 1 4 204 f5a4820 softnet/4
30 1 4 205 f5a4c00 idle/4
29 3 3 204 f5a4fe0 xcall/3 xcall
28 1 3 204 f5a53c0 softser/3
27 1 3 204 f5a57a0 softclk/3
26 1 3 204 f5a5b80 softbio/3
25 1 3 204 f5a2040 softnet/3
> 24 7 3 205 f5a2420 idle/3
23 3 2 204 f5a2800 xcall/2 xcall
22 1 2 204 f5a2be0 softser/2
21 1 2 204 f5a2fc0 softclk/2
20 1 2 204 f5a33a0 softbio/2
19 1 2 204 f5a3780 softnet/2
18 1 2 205 f5a3b60 idle/2
17 3 1 204 f594020 xcall/1 xcall
16 1 1 204 f594400 softser/1
15 1 1 204 f5947e0 softclk/1
14 1 1 204 f594bc0 softbio/1
13 1 1 204 f594fa0 softnet/1
> 12 7 1 205 f595380 idle/1
11 3 0 204 f595760 pmfevent pmfevent
10 3 1 204 f595b40 nfssilly nfssilly
9 3 0 204 f58e000 cachegc cachegc
8 3 4 204 f58e3e0 vrele vrele
7 3 0 204 f58e7c0 xcall/0 xcall
6 1 0 204 f58eba0 softser/0
5 1 0 204 f58ef80 softclk/0
4 1 0 204 f58f360 softbio/0
3 1 0 204 f58f740 softnet/0
> 2 7 0 205 f58fb20 idle/0
1 3 0 204 181bdc0 swapper schedule
---8<---8<---8<---
The machine is now running off of a plain ffs on sd2a, no wapbl active
or anything, and the RAIDframe RAID1-set on sd0+sd1 that is supposed to be
/ is inactive and unmounted (by booting sd2 with boot -a and forcing /
onto sd2a).
This box is hooked up with a serial console to a personal sparc64/4.0
box of mine, so if any sparc64-developer would like I can provide an ssh
account with console access and give you free hands with it. I can't
power-cycle it with short notice if it hangs hard, but now it seems able
to drop into ddb with a LOCKDEBUG kernel.
Best regards,
ali:)
Home |
Main Index |
Thread Index |
Old Index