Subject: port-alpha/13525: scsi related panic on alpha
To: None <gnats-bugs@gnats.netbsd.org>
From: Tim Rightnour <root@polaris.garbled.net>
List: netbsd-bugs
Date: 07/21/2001 08:59:16
>Number:         13525
>Category:       port-alpha
>Synopsis:       scsi related panic on alpha
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    port-alpha-maintainer
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sat Jul 21 08:53:02 PDT 2001
>Closed-Date:
>Last-Modified:
>Originator:     Tim Rightnour
>Release:        NetBSD 1.5V
>Organization:
	
>Environment:
	
NetBSD giauzar 1.5V NetBSD 1.5V (GIAUZAR) #0: Thu May  3 07:47:35 UTC 2001     root@giauzar:/usr/src/cvs/src/sys/arch/alpha/compile/GIAUZAR alpha


>Description:
Machine panic'd in the middle of the night.  Approximately 8-12 hours after
using the CDROM (on the same scsibus) very heavily.  It looks like it died
during the daily run out of cron.  some of the dmesg, and what little
debugging data is replicated below:

no core is available.  For some unknown reason I have not been able to gather
cores from this machine.

cd0(siop0:0:6:0): command timeout
siop0: scsi bus reset
cd0(siop0:0:6:0): command with tag id 0 reset
cd0: Async, 8-bit transfers
cd0: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers
sd2: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers
cd0(siop0:0:6:0): command timeout
siop0: scsi bus reset
sd2: Async, 8-bit transfers
cd0(siop0:0:6:0): command with tag id 0 reset
cd0: Async, 8-bit transfers
cd0: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers
sd2: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers
Warning: received processor correctable error.
Warning: received processor correctable error.
Warning: received processor correctable error.
sd4: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers, tagged queueing
sd3: Sync (100.0ns offset 8), 8-bit (10.000MB/s) transfers
siop0: command with invalid status (IRQ code 0xff01 current status 0) !
siop0: unhandled message 0x80
sd2: dk_busy < 0
panic: disk_unbusy
Stopped in pid 3 (siop0:0) at   cpu_Debugger+0x4:       ret     zero,(ra)
db> 
db> trace
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0xfc
disk_unbusy() at disk_unbusy+0x58
sddone() at sddone+0x58
scsipi_complete() at scsipi_complete+0x424
scsipi_completion_thread() at scsipi_completion_thread+0xa4
esigcode() at esigcode
--- root of call graph ---
db> 
db> ps
 PID             PPID       PGRP        UID S   FLAGS          COMMAND    WAIT
 23625          23618      23617          0 3  0x4004             find biowait
<cut some data out.. sorry>
db> c
syncing disks... warning: mfs read during shutdown
sd2: dk_busy < 0
panic: disk_unbusy
Stopped in pid 3 (siop0:0) at   cpu_Debugger+0x4:       ret     zero,(ra)
db> 
db> c
dumping to dev 4,1 offset 282111

>How-To-Repeat:
Unknown.. I'm not even sure what happened.
	
>Fix:
Hope so.  ;)
	
>Release-Note:
>Audit-Trail:
>Unformatted: