NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/41417



The following reply was made to PR kern/41417; it has been noted by GNATS.

From: "Aaron J. Grier" <aaronx.j.grier%intel.com@localhost>
To: gnats-bugs%netbsd.org@localhost
Cc: 
Subject: Re: kern/41417
Date: Thu, 16 Sep 2010 10:55:02 -0700

 seeing the same thing on a hyperthreaded dual xeon (kernel sees four
 cpus) running 5.1_RC3:
 
 NetBSD northwood 5.1_RC3 NetBSD 5.1_RC3 (GENERIC) #1: Mon Jul 26 16:22:59 PDT 
2010  
ajgrier@ajgrierdev:/home/ajgrier/obj/i386/home/ajgrier/NetBSD/5/sys/arch/i386/compile/GENERIC
 i386
 
  PID    LADDR WCHAN  TTY      STAT    TIME COMMAND
    1 cc45b860 wait   ?        DWs  0:00.00 init 
  127 cd89d540 -      ?        RWs  0:00.00 /usr/sbin/syslogd -s 
  162 cdd565c0 select ?        DWs  0:00.00 (screen-4.0.3)
  241 cd89d2c0 pause  ?        DWs  0:00.00 /usr/sbin/ntpd 
  262 cc45a5c0 select ?        DWs  0:00.00 /usr/sbin/sshd 
  380 cdbfcd00 kqueue ?        DW   0:00.00 (qmgr)
  391 cdbf37e0 kqueue ?        DWs  0:00.00 /usr/libexec/postfix/master 
  413 cdbfc800 kqueue ?        DWs  0:00.00 /usr/sbin/inetd -l 
  435 cdbfc300 tstile ?        DWs  0:00.00 /usr/libexec/getty Pc ttyE2 (cron)
  452 cdd56ac0 select ?        DW   0:00.00 (sshd)
  584 cdcef320 netio  ?        DWs  0:00.00 (sshd)
 2190 cd89d040 select ?        DW   0:00.00 (sshd)
 2408 cd894ca0 tstile ?        DW   0:00.00 (master)
 2988 cdbfca80 netio  ?        DWs  0:00.00 (sshd)
    0        0 -      console- ?W<  0:00.00 ()
  598 cdcefaa0 tstile ttyp0    DWs  0:00.00 (ksh)
  411 cdd56340 pause  ttyp1    DWs  0:00.00 (ksh)
  589 cdd560c0 tstile ttyp1    DW   0:00.00 (sh)
 1490 cdbf3060 tstile ttyp1    DW   0:00.00 (dumper)
 1922 cd89da40 tstile ttyp1    DW   0:00.00 (dumper)
 2053 cde6c5e0 tstile ttyp1    DW   0:00.00 (taper)
 2086 cdbfc580 tstile ttyp1    DW   0:00.00 (dumper)
 2349 cdf3ba00 tstile ttyp1    DW   0:00.00 (planner)
 2371 cdf3b780 tstile ttyp1    DW   0:00.00 (dumper)
 2383 cde6cae0 -      ttyp1    TW   0:00.00 (sh)
 3408 cde6c360 -      ttyp1    TW   0:00.00 (driver)
  622 cde6cd60 tstile ttyp2    DWs  0:00.00 (ksh)
 1633 cdf3b500 pause  ttyp3    DW   0:00.00 (screen-4.0.3)
 2003 cde6c860 pause  ttyp3    DWs  0:00.00 (ksh)
 3146 cdbf32e0 pause  ttyp4    DWs  0:00.00 (ksh)
 3209 cdcef0a0 tstile ttyp4    DW   0:00.00 (ksh)
  504 cd7ec280 tstile ttyE0    DWs  0:00.00 (getty)
  306 cd7ec000 ttyraw ttyE1    DWs  0:00.00 /usr/libexec/getty Pc ttyE3 
  293 cdcef820 ttyraw ttyE2    DWs  0:00.00 pickup -l -t fifo -u (getty)
  427 cdcef5a0 ttyraw ttyE3    DWs  0:00.00 qmgr -l -t unix -u (getty)
 
 all of them stuck past VOP_LOCK, as an example:
 
 #0  0xc04ab3c8 in mi_switch ()
 #1  0xc04a810b in sleepq_block ()
 #2  0xc04b4635 in turnstile_block ()
 #3  0xc04a4bfe in rw_vector_enter ()
 #4  0xc04fc89e in vlockmgr ()
 #5  0xc0410bb1 in ffs_lock ()
 #6  0xc050b7a0 in VOP_LOCK ()
 #7  0xc0505e68 in vn_lock ()
 #8  0xc04fa25b in namei ()
 #9  0xc0501687 in do_sys_stat ()
 #10 0xc050173c in sys___stat30 ()
 #11 0xc058346d in syscall ()
 #12 0xc0100525 in syscall1 ()
 
 running backups triggers it for me.  I have the crash dump and could dig a bit
 more if desired.
 
 the workaround (disable log) works, but obviously WAPBL is nice for
 faster reboots.
 
 -- 
   Aaron J. Grier | "Not speaking on behalf of Intel" | 
aaronx.j.grier%intel.com@localhost
 


Home | Main Index | Thread Index | Old Index