Subject: Re: port-sparc/13298: sparc esp driver leaves processes stuck forever after a time out
To: NetBSD GNATS submissions and followups <gnats-bugs@gnats.netbsd.org>
From: Greg A. Woods <woods@weird.com>
List: netbsd-bugs
Date: 06/24/2001 22:46:14
[ On Sunday, June 24, 2001 at 14:54:36 (-0400), Greg A. Woods wrote: ]
> Subject: port-sparc/13298: sparc esp driver leaves processes stuck forever after a time out
> 
> Jun 24 12:35:50 sometimes /netbsd: sd1(esp0:0:1:0): esp0: timed out [ecb 0xf09494e0 (flags 0x1, dleft 800, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 2000, msg(q 0,o 0) >
 
Well it happened again, but with a bit more info this time:

sd1: waiting for pack to spin up...
sd1(esp0:0:1:0): esp0: timed out [ecb 0xf0852c08 (flags 0x1, dleft 2000, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 0, msg(q 0,o 0) >
sd1(esp0:0:1:0): esp0: timed out [ecb 0xf0852ab8 (flags 0x1, dleft 2000, stat 0)], <state 1, nexus 0x0, phase(l 10, c 100, p 3), resid 0, msg(q 0,o 0) >

This time the triggering process was stuck in 'getblk':

    0 24532 24531  24  -5   0   192  192 getblk   D    p0 0:00.03 make -m /usr/


Hmmmm... I thought... if the disk takes itself offline momentarily then
that smells a lot like the disk is doing something bad, like maybe
encountering an error that 'esp's not reporting properly....

Well it turns out that ARRE wasn't enabled on the disk (damn I wish this
would happen automatically if the driver's not going to do reassignment!)

Using the old FreeBSD "scsi" tool (which still works just fine for
mode-page adjustments on NetBSD, including on sparc), I turned it on and
did a 'dd if=/dev/rsd1c of=/dev/null' with narry a problem.  The
performance wasn't very stunning though:

	4197405+0 records in
	4197405+0 records out
	2149071360 bytes transferred in 4885.573 secs (439881 bytes/sec)

So, one more reboot to restart the 'make build' again....

This time the disk synching failed but the reboot didn't hang....


I wonder if this problem is the same or similar to the one Jim Bernard
has been reporting on port-sparc under the heading "SCSI probs on spork 10."


In any case it seems to me that the 'esp' driver hasn't kept up to the
new scsipi times.....

-- 
							Greg A. Woods

+1 416 218-0098      VE3TCP      <gwoods@acm.org>     <woods@robohack.ca>
Planix, Inc. <woods@planix.com>;   Secrets of the Weird <woods@weird.com>