NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
port-alpha/38335: kernel freeze on alpha MP system
>Number: 38335
>Category: port-alpha
>Synopsis: kernel freeze on alpha MP system
>Confidential: no
>Severity: serious
>Priority: medium
>Responsible: port-alpha-maintainer
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Sat Mar 29 22:00:01 +0000 2008
>Originator: Jarle Greipsland
>Release: NetBSD -current
>Organization:
>Environment:
System: NetBSD sweetheart.urc.uninett.no 4.99.58 NetBSD 4.99.58
(GENERIC-$Revision: 1.314 $) #1: Sat Mar 29 20:46:15 CET 2008
jarle%sweetheart.urc.uninett.no@localhost:/usr/obj/sys/arch/alpha/compile/CS20.MP
alpha
Architecture: alpha
Machine: alpha
>Description:
A -current alpha kernel on a CS20 dual processor system got stuck in the
kernel during a NetBSD build job. After more than three minutes without
any response from the system, I managed to break into ddb, and gather some
data:
Stopped in pid 27607.1 (sh) at netbsd:cpu_Debugger+0x4: ret zero,(ra
)
db{0}> trace
cpu_Debugger() at netbsd:cpu_Debugger+0x4
comintr() at netbsd:comintr+0x718
alpha_shared_intr_dispatch() at netbsd:alpha_shared_intr_dispatch+0x5c
sio_iointr() at netbsd:sio_iointr+0x38
interrupt() at netbsd:interrupt+0x1c0
XentInt() at netbsd:XentInt+0x1c
--- interrupt (from ipl 0) ---
rw_vector_exit() at netbsd:rw_vector_exit+0x90
vm_map_unlock_read() at netbsd:vm_map_unlock_read+0x1c
uvm_fault_internal() at netbsd:uvm_fault_internal+0x16c8
trap() at netbsd:trap+0x6c8
XentMM() at netbsd:XentMM+0x20
--- memory management fault ---
memcpy() at netbsd:memcpy+0x54
copyout() at netbsd:copyout+0x50
copyout_vmspace() at netbsd:copyout_vmspace+0x7c
uiomove() at netbsd:uiomove+0xec
ubc_uiomove() at netbsd:ubc_uiomove+0xc8
ffs_read() at netbsd:ffs_read+0x3a8
VOP_READ() at netbsd:VOP_READ+0xcc
vn_rdwr() at netbsd:vn_rdwr+0xd0
vmcmd_readvn() at netbsd:vmcmd_readvn+0x64
execve1() at netbsd:execve1+0x790
syscall_plain() at netbsd:syscall_plain+0x1c4
XentSys() at netbsd:XentSys+0x60
--- syscall (59) ---
--- user mode ---
db{0}> show reg
v0 0xf9
t0 0xfffffc0000686a74 db_fromconsole
t1 0x1
t2 0xff00
t3 0xfffffc00006c0088 cn_magic
t4 0x2
t5 0xa0b
t6 0
t7 0xfffffc00006e11e8 vm_physmem
s0 0xf9
s1 0xc6
s2 0xfffffe00004cd000
s3 0xfffffe00004cd070
s4 0xfffffe00004d5008
s5 0x7fd
s6 0xfffffc00006a6b60 idstate+0x1ec20
a0 0xfffffc00006ea690 tsp_configuration
a1 0xfffffd01fc0003fd
a2 0x5
a3 0xfffffe0013d0b298
a4 0x22
a5 0
t8 0xfffffc00006e11e8 vm_physmem
t9 0xfffffc00004e38ac bt_hashhead+0x5c
t10 0x600f51dc
t11 0x1a5
ra 0xfffffc00003b1e78 comintr+0x718
t12 0xfffffc00005b7340 cpu_Debugger
at 0xfffffe0013d08000
gp 0xfffffc0000686b60 nfsrtton
sp 0xfffffe0013d0b1c8
pc 0xfffffc00005b7344 cpu_Debugger+0x4
ps 0x6
ai 0x1a5
pv 0xfffffc00005b7340 cpu_Debugger
netbsd:cpu_Debugger+0x4: ret zero,(ra)
db{0}> ps
PID PPID PGRP UID S FLAGS LWPS COMMAND WAIT
>27607 11306 20939 0 2 0 1 sh
11306 8909 20939 0 2 0 1 sh wait
8909 4701 20939 0 2 0x4000 1 sh piperd
4701 20692 20939 0 2 0x4000 1 sh wait
20692 20939 20939 0 2 0x4000 1 nbgmake wait
20939 11274 20939 0 2 0x4000 1 sh wait
11274 18129 18129 0 2 0x4000 1 nbmake select
18129 22431 18129 0 2 0x4000 1 sh wait
22431 5699 5699 0 2 0x4000 1 nbmake select
5699 3213 5699 0 2 0x4000 1 sh wait
3213 1188 1178 0 2 0x4000 1 nbmake select
1084 463 1084 1000 2 0x4000 1 top select
1188 1178 1178 0 2 0x4000 1 sh wait
1182 389 1182 0 2 0x4000 1 tail kqueue
1178 389 1178 0 2 0 1 tcsh pause
463 435 463 1000 2 0x4000 1 bash wait
435 443 443 1000 2 0x4000 1 xterm select
443 437 443 1000 2 0x4000 1 csh pause
437 66 66 1000 2 0x100 1 sshd select
66 248 66 0 2 0x4100 1 sshd netio
389 381 389 0 2 0x4000 1 tcsh pause
381 383 381 1000 2 0x4000 1 bash wait
383 294 294 1000 2 0x4000 1 xterm select
294 301 294 1000 2 0x4000 1 csh pause
301 327 327 1000 2 0x100 1 sshd select
327 248 327 0 2 0x4100 1 sshd netio
377 342 342 12 2 0x4100 1 qmgr kqueue
370 342 342 12 2 0x4100 1 pickup kqueue
369 1 369 0 2 0x4000 1 getty tty
372 1 372 0 2 0 1 cron nanoslp
355 1 355 0 2 0 1 inetd kqueue
342 1 342 0 2 0x4100 1 master kqueue
248 1 248 0 2 0 1 sshd select
236 1 236 0 2 0 1 ntpd pause
156 1 156 0 2 0 1 mount_mfs mfsidl
73 1 73 0 2 0 1 syslogd kqueue
1 0 1 0 2 0x4001 1 init wait
0 -1 0 0 2 0x20002 25 system *
db{0}> mach cpu 1
Using CPU 1
db{0}> trace
CPU 0: fatal kernel trap:
CPU 0 trap entry = 0x4 (unaligned access fault)
CPU 0 a0 = 0xfffffc00006e110d
CPU 0 a1 = 0x29
CPU 0 a2 = 0xa
CPU 0 pc = 0xfffffc00005b7b78
CPU 0 ra = 0xfffffc0000399044
CPU 0 pv = 0xfffffc00005b7ac0
CPU 0 curlwp = 0xfffffc00391713a0
CPU 0 pid = 27607, comm = sh
Caught exception in ddb.
db{0}> show reg
v0 0
t0 0
t1 0
t2 0
t3 0xfffffe0013243880
t4 0
t5 0xfffffc00006ecdc8 kernel_pmap_store
t6 0xfffffc00006ece28 kernel_pmap_store+0x60
t7 0x1
s0 0x1
s1 0xfffffc003f815980
s2 0xfffffc00006e1135 uvm+0xd
s3 0xfffffc00006e1128 uvm
s4 0xfffffe00004a3c00
s5 0x1
s6 0xfffffe001324fdc0
a0 0
a1 0x1
a2 0xfffffe0013bae000
a3 0x1
a4 0
a5 0xfffffe0013bb1dcc
t8 0x80
t9 0x67
t10 0xffffffff
t11 0xfffffc003b1e9328
ra 0xfffffc000049e95c idle_loop+0x12c
t12 0xfffffc00004cb360 sched_curcpu_runnable_p
at 0x3
gp 0xfffffc0000686b60 nfsrtton
sp 0xfffffc00006e1135 uvm+0xd
pc 0xfffffc000049e934 idle_loop+0x104
ps 0
ai 0xfffffc003b1e9328
pv 0xfffffc00004cb360 sched_curcpu_runnable_p
netbsd:idle_loop+0x104: ldq_u t0,d(s3)
>How-To-Repeat:
Run an MP kernel on an alpha system?
>Fix:
>Unformatted:
Home |
Main Index |
Thread Index |
Old Index