Subject: sd diagnostics
To: None <port-sparc@NetBSD.ORG>
From: Greg Earle <earle@isolar.Tujunga.CA.US>
List: port-sparc
Date: 09/09/1996 17:38:46
Anyone help me decipher these?

sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 07 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 15 ef d0 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 16 38 d6 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 0f 98 00 a1 00 00 00 00 02 08 16 38 d6 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

[9 minutes later:]

sd3(esp0:2:0): non-media hardware failure, data = 20 08 88 08 d0 1f 00 00 04 40 07 98 00 a1 00 00 00 00 02 08 00 00 10 10 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

[12 seconds later:]

sd3(esp0:2:0): esp0: timed out [ecb 0xf86538f8 (flags 0x0, dleft 200, stat 0)], <state 3, nexus 0xf86538f8, phase(c 100, p 101), resid 200, msg(q 0,o 0) >
sd0(esp0:3:0): esp0: timed out [ecb 0xf86538cc (flags 0x2, dleft 800, stat 0)], <state 3, nexus 0xf86538f8, phase(c 100, p 101), resid 200, msg(q 0,o 0) >
sd3(esp0:2:0): esp0: timed out [ecb 0xf86538f8 (flags 0x10, dleft 200, stat 0)], <state 3, nexus 0xf86538f8, phase(c 100, p 101), resid 200, msg(q 0,o 0) > AGAIN
esp0: SCSI bus reset
sd3(esp0:2:0): max sync rate 4.80Mb/s
sd3(esp0:2:0): target put in SYNC mode

At boot time, this disk was probed curiously:

esp0 at sbus0 slot 0 offset 0x800000 pri 3: ESP100A 25Mhz, target 7
scsibus0 at esp0
sd1 at scsibus0 targ 0 lun 0: <FUJITSU, M2266S-512, 001F> SCSI1 0/direct fixed
sd1: 1029MB, 1658 cyl, 15 head, 84 sec, 512 bytes/sec
sd2 at scsibus0 targ 1 lun 0: <FUJITSU, M2266S-512, 001F> SCSI1 0/direct fixed
sd2: 1029MB, 1658 cyl, 15 head, 84 sec, 512 bytes/sec
probe(esp0:2:0): max sync rate 4.80Mb/s
probe(esp0:2:0): target put in SYNC mode
sd3 at scsibus0 targ 2 lun 0: <FUJITSU, M2266S-512, 001F> SCSI1 0/direct fixed
sd3: 1029MB, 1658 cyl, 15 head, 84 sec, 512 bytes/sec
sd0 at scsibus0 targ 3 lun 0: <SEAGATE, ST1480   SUN0424, 5826> SCSI2 0/direct f
ixed
sd0: 411MB, 1476 cyl, 9 head, 63 sec, 512 bytes/sec

Notice that only sd3 was probed with a 4.80 Mb/sec sync rate, even though I
have two other disks which are the same exact physically identical drive.

I'm curious as to whether these messages point to the disk itself (sd3) or to
the communication on the SCSI bus (i.e., maybe a cable needs to be replaced or
shortened, etc.).  Since they're not the same exact format as the SunOS 4.1.x
esp messages I'm used to deciphering, it's not obvious to me.  Thanks.

	- Greg