NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

port-alpha/38335: kernel freeze on alpha MP system



>Number:         38335
>Category:       port-alpha
>Synopsis:       kernel freeze on alpha MP system
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    port-alpha-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sat Mar 29 22:00:01 +0000 2008
>Originator:     Jarle Greipsland
>Release:        NetBSD -current
>Organization:
        
>Environment:
        
        
System: NetBSD sweetheart.urc.uninett.no 4.99.58 NetBSD 4.99.58 
(GENERIC-$Revision: 1.314 $) #1: Sat Mar 29 20:46:15 CET 2008  
jarle%sweetheart.urc.uninett.no@localhost:/usr/obj/sys/arch/alpha/compile/CS20.MP
 alpha
Architecture: alpha
Machine: alpha
>Description:
A -current alpha kernel on a CS20 dual processor system got stuck in the
kernel during a NetBSD build job.  After more than three minutes without
any response from the system, I managed to break into ddb, and gather some
data:

Stopped in pid 27607.1 (sh) at  netbsd:cpu_Debugger+0x4:        ret     zero,(ra
)
db{0}> trace
cpu_Debugger() at netbsd:cpu_Debugger+0x4
comintr() at netbsd:comintr+0x718
alpha_shared_intr_dispatch() at netbsd:alpha_shared_intr_dispatch+0x5c
sio_iointr() at netbsd:sio_iointr+0x38
interrupt() at netbsd:interrupt+0x1c0
XentInt() at netbsd:XentInt+0x1c
--- interrupt (from ipl 0) ---
rw_vector_exit() at netbsd:rw_vector_exit+0x90
vm_map_unlock_read() at netbsd:vm_map_unlock_read+0x1c
uvm_fault_internal() at netbsd:uvm_fault_internal+0x16c8
trap() at netbsd:trap+0x6c8
XentMM() at netbsd:XentMM+0x20
--- memory management fault ---
memcpy() at netbsd:memcpy+0x54
copyout() at netbsd:copyout+0x50
copyout_vmspace() at netbsd:copyout_vmspace+0x7c
uiomove() at netbsd:uiomove+0xec
ubc_uiomove() at netbsd:ubc_uiomove+0xc8
ffs_read() at netbsd:ffs_read+0x3a8
VOP_READ() at netbsd:VOP_READ+0xcc
vn_rdwr() at netbsd:vn_rdwr+0xd0
vmcmd_readvn() at netbsd:vmcmd_readvn+0x64
execve1() at netbsd:execve1+0x790
syscall_plain() at netbsd:syscall_plain+0x1c4
XentSys() at netbsd:XentSys+0x60
--- syscall (59) ---
--- user mode ---
db{0}> show reg
v0          0xf9
t0          0xfffffc0000686a74  db_fromconsole
t1          0x1
t2          0xff00
t3          0xfffffc00006c0088  cn_magic
t4          0x2
t5          0xa0b
t6          0
t7          0xfffffc00006e11e8  vm_physmem
s0          0xf9
s1          0xc6
s2          0xfffffe00004cd000
s3          0xfffffe00004cd070
s4          0xfffffe00004d5008
s5          0x7fd
s6          0xfffffc00006a6b60  idstate+0x1ec20
a0          0xfffffc00006ea690  tsp_configuration
a1          0xfffffd01fc0003fd
a2          0x5
a3          0xfffffe0013d0b298
a4          0x22
a5          0
t8          0xfffffc00006e11e8  vm_physmem
t9          0xfffffc00004e38ac  bt_hashhead+0x5c
t10         0x600f51dc
t11         0x1a5
ra          0xfffffc00003b1e78  comintr+0x718
t12         0xfffffc00005b7340  cpu_Debugger
at          0xfffffe0013d08000
gp          0xfffffc0000686b60  nfsrtton
sp          0xfffffe0013d0b1c8
pc          0xfffffc00005b7344  cpu_Debugger+0x4
ps          0x6
ai          0x1a5
pv          0xfffffc00005b7340  cpu_Debugger
netbsd:cpu_Debugger+0x4:        ret     zero,(ra)
db{0}> ps
 PID           PPID     PGRP        UID S   FLAGS LWPS          COMMAND    WAIT
>27607        11306    20939          0 2       0    1               sh
 11306         8909    20939          0 2       0    1               sh    wait
 8909          4701    20939          0 2  0x4000    1               sh  piperd
 4701         20692    20939          0 2  0x4000    1               sh    wait
 20692        20939    20939          0 2  0x4000    1          nbgmake    wait
 20939        11274    20939          0 2  0x4000    1               sh    wait
 11274        18129    18129          0 2  0x4000    1           nbmake  select
 18129        22431    18129          0 2  0x4000    1               sh    wait
 22431         5699     5699          0 2  0x4000    1           nbmake  select
 5699          3213     5699          0 2  0x4000    1               sh    wait
 3213          1188     1178          0 2  0x4000    1           nbmake  select
 1084           463     1084       1000 2  0x4000    1              top  select
 1188          1178     1178          0 2  0x4000    1               sh    wait
 1182           389     1182          0 2  0x4000    1             tail  kqueue
 1178           389     1178          0 2       0    1             tcsh   pause
 463            435      463       1000 2  0x4000    1             bash    wait
 435            443      443       1000 2  0x4000    1            xterm  select
 443            437      443       1000 2  0x4000    1              csh   pause
 437             66       66       1000 2   0x100    1             sshd  select
 66             248       66          0 2  0x4100    1             sshd   netio
 389            381      389          0 2  0x4000    1             tcsh   pause
 381            383      381       1000 2  0x4000    1             bash    wait
 383            294      294       1000 2  0x4000    1            xterm  select
 294            301      294       1000 2  0x4000    1              csh   pause
 301            327      327       1000 2   0x100    1             sshd  select
 327            248      327          0 2  0x4100    1             sshd   netio
 377            342      342         12 2  0x4100    1             qmgr  kqueue
 370            342      342         12 2  0x4100    1           pickup  kqueue
 369              1      369          0 2  0x4000    1            getty     tty
 372              1      372          0 2       0    1             cron nanoslp
 355              1      355          0 2       0    1            inetd  kqueue
 342              1      342          0 2  0x4100    1           master  kqueue
 248              1      248          0 2       0    1             sshd  select
 236              1      236          0 2       0    1             ntpd   pause
 156              1      156          0 2       0    1        mount_mfs  mfsidl
 73               1       73          0 2       0    1          syslogd  kqueue
 1                0        1          0 2  0x4001    1             init    wait
 0               -1        0          0 2 0x20002   25           system       *
db{0}> mach cpu 1 
Using CPU 1
db{0}> trace

CPU 0: fatal kernel trap:

CPU 0    trap entry = 0x4 (unaligned access fault)
CPU 0    a0         = 0xfffffc00006e110d
CPU 0    a1         = 0x29
CPU 0    a2         = 0xa
CPU 0    pc         = 0xfffffc00005b7b78
CPU 0    ra         = 0xfffffc0000399044
CPU 0    pv         = 0xfffffc00005b7ac0
CPU 0    curlwp     = 0xfffffc00391713a0
CPU 0        pid = 27607, comm = sh

Caught exception in ddb.
db{0}> show reg
v0          0
t0          0
t1          0
t2          0
t3          0xfffffe0013243880
t4          0
t5          0xfffffc00006ecdc8  kernel_pmap_store
t6          0xfffffc00006ece28  kernel_pmap_store+0x60
t7          0x1
s0          0x1
s1          0xfffffc003f815980
s2          0xfffffc00006e1135  uvm+0xd
s3          0xfffffc00006e1128  uvm
s4          0xfffffe00004a3c00
s5          0x1
s6          0xfffffe001324fdc0
a0          0
a1          0x1
a2          0xfffffe0013bae000
a3          0x1
a4          0
a5          0xfffffe0013bb1dcc
t8          0x80
t9          0x67
t10         0xffffffff
t11         0xfffffc003b1e9328
ra          0xfffffc000049e95c  idle_loop+0x12c
t12         0xfffffc00004cb360  sched_curcpu_runnable_p
at          0x3
gp          0xfffffc0000686b60  nfsrtton
sp          0xfffffc00006e1135  uvm+0xd
pc          0xfffffc000049e934  idle_loop+0x104
ps          0
ai          0xfffffc003b1e9328
pv          0xfffffc00004cb360  sched_curcpu_runnable_p
netbsd:idle_loop+0x104: ldq_u   t0,d(s3)

        
>How-To-Repeat:
Run an MP kernel on an alpha system?
        
>Fix:
        

>Unformatted:
        
        


Home | Main Index | Thread Index | Old Index