Subject: is this raidframe? (2.0_BETA)
To: None <current-users@netbsd.org>
From: Jeff Rizzo <riz@boogers.sf.ca.us>
List: current-users
Date: 05/08/2004 08:00:44
I got up this morning to find that a 2.0_BETA system (i386, dual PIII,
raidframe with raid1) installed the other day was hung in a way that
the console was unresponsive, but I'm able to get into ddb.
I've included trace info from both processors, and a ps.  Is there
any other information I should gather?  I can leave this system
hung like this for perhaps 24 hours or so, if there's any other information
which would be helpful.

Also, is there anything I can do to make it more likely that I can reboot
the system from this state?  It's not local to me, and it seems to me
that about 50% of the time when I type "reboot" from ddb, I wind up
having to powercycle the machine, which is going to be problematic this
time.  :(

Anyway, here's the trace info:

db{0}> trace 
cpu_Debugger(cd5ce5dc,0,400040,0,fffffffe) at netbsd:cpu_Debugger+0x4
comintr(c2eb6c00,0,cd5c0010,30,cdfd0010) at netbsd:comintr+0x729
Xintr_ioapic_edge4() at netbsd:Xintr_ioapic_edge4+0xa4
--- interrupt ---
checkalias(d3ddc3e0,1207,c3239400,ce1bd370,0) at netbsd:checkalias+0x5c
ufs_vinit(c3239400,c29a7200,c29a7100,cdfdaca8,c363b3e0) at netbsd:ufs_vinit+0x69

ffs_vget(c3239400,1475d96,cdfdad64,4000,400000) at netbsd:ffs_vget+0x274
ufs_lookup(cdfdad94,d49fcba8,cdfdadac,c03bcad1,c05dc8a0) at netbsd:ufs_lookup+0x
6d8
VOP_LOOKUP(d0b7a014,cdfdae84,cdfdae98,6,c05dd020) at netbsd:VOP_LOOKUP+0x2e
lookup(cdfdae74,cf395800,400,cdfdae8c,cdfdae24) at netbsd:lookup+0x221
namei(cdfdae74,807f948,60,0,807fa40) at netbsd:namei+0x120
sys___lstat13(cdfdfbf4,cdfdaf64,cdfdaf5c,118,c0371687) at netbsd:sys___lstat13+0
x58
syscall_plain() at netbsd:syscall_plain+0x17e
--- syscall (number 280) ---
0x480e7547:
db{0}> mach cpu 1
using CPU 1
db{0}> trace
acquire(c07e8a20,cd54deac,400000,0,600) at netbsd:acquire+0x44
spinlock_acquire_count(c07e8a20,1,cd54deec,c01b5077,0) at netbsd:spinlock_acquir
e_count+0x6b
mi_switch(cd52e39c,0,0,0,c2ecc000) at netbsd:mi_switch+0x164
ltsleep(c2ecc0d0,10,c06e28e2,0,c2ecc0d8) at netbsd:ltsleep+0x40e
rf_RaidIOThread(c2ecc000,8b2000,8bb000,0,c010030c) at netbsd:rf_RaidIOThread+0x1
5f
db{0}> ps
 PID           PPID     PGRP        UID S   FLAGS LWPS          COMMAND    WAIT
>761          28135    10102      32767 2  0x4000    1             find
 28135        13499    10102      32767 2  0x4000    1               sh    wait
 13499        20188    10102      32767 2  0x4000    1              csh   pause
 9936         10102    10102         17 2  0x4100    1         sendmail  piperd
 4696         10102    10102          0 2  0x4000    1              tee  piperd
 20188        10102    10102          0 2  0x4000    1               sh    wait
 10102        11665    10102          0 2  0x4000    1               sh    wait
 11665          432      432          0 2       0    1             cron  piperd
 13397         3682    13397       6004 2  0x4002    1             tail  kqread
 22120         5940    22120       6004 2  0x4002    1             tcsh   ttyin
 5940         18475    18475       6004 2   0x100    1             sshd
 18475          343    18475          0 2   0x101    1             sshd   netio
 450           7197     7197      32767 2   0x100    1          distccd  netcon
 449           7197     7197      32767 2   0x100    1          distccd  netcon
 448           7197     7197      32767 2   0x100    1          distccd  netcon
 6992          7197     7197      32767 2   0x100    1          distccd  netcon
 7197             1     7197      32767 2   0x100    1          distccd    wait
 4109             1      330       6004 2  0x4002    1              ssh  select
 4399          4149     4399       6004 2  0x4002    1             tcsh   ttyin
 4149          2723     2723       6004 2   0x100    1             sshd
 2723           343     2723          0 2   0x101    1             sshd   netio
 3682          1361     3682       6004 2  0x4002    1             tcsh   pause
 1361          4374     4374       6004 2   0x100    1             sshd  select
 4374           343     4374          0 2   0x100    1             sshd   netio
 3562          4083     4083          0 2     0x2    1            rsync
 3917          4083     4083          0 2  0x4002    1              ssh
 4083           822     4083          0 2  0x5002    1            rsync
 3194             1     3194          0 2  0x4002    1            getty
 822            760      822          0 2  0x5002    1              csh   pause
 760            756      760       6004 2  0x4002    1              csh   pause
 756            365      365       6004 2   0x100    1             sshd
 365            343      365          0 2   0x101    1             sshd   netio
 445            463      445          0 2  0x5002    1              csh   ttyin
 463            467      463       6004 2  0x4002    1              csh   pause
 467            444      444       6004 2   0x100    1             sshd  select
 444            343      444          0 2     0x1    1             sshd   netio
 432              1      432          0 2       0    1             cron
 414              1      414          0 2       0    1            inetd  kqread
 343              1      343          0 2       0    1             sshd  select
 351              1      351          0 2       0    1             ntpd
 274              1      274          0 2       0    1        mount_mfs  mfsidl
 213              1      213          0 2       0    1          syslogd    poll
 12               0        0          0 2 0x20200    1         aiodoned aiodone
 11               0        0          0 2 0x20200    1          ioflush drainvp
 10               0        0          0 2 0x20200    1       pagedaemon pgdaemo
 9                0        0          0 2 0x20200    1       lfs_writer lfswrit
 8                0        0          0 2 0x20200    1          raidio0
 7                0        0          0 2 0x20200    1            raid0 rfwcond
 6                0        0          0 2 0x20200    1          usbtask  usbtsk
 5                0        0          0 2 0x20200    1             usb0
 4                0        0          0 2 0x20200    1          atabus1   atath
 3                0        0          0 2 0x20200    1          atabus0   atath
 2                0        0          0 2 0x20200    1        cryptoret crypto_
 1                0        1          0 2  0x4000    1             init    wait
 0               -1        0          0 2 0x20200    1          swapper
db{0}> 


-- 
Jeff Rizzo                                         http://boogers.sf.ca.us/~riz