Subject: two disks, two controlers, same bad block
To: None <tech-kern@netbsd.org>
From: George Georgalis <george@galis.org>
List: tech-kern
Date: 03/27/2006 00:40:20
looks like a bad block on the disk... 268435455

Mar 26 23:18:38 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:18:38 trot /netbsd: wd1: (id not found)
Mar 26 23:18:39 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:18:39 trot /netbsd: wd1: (id not found)
Mar 26 23:18:40 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:18:40 trot /netbsd: wd1: (id not found)
Mar 26 23:18:40 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:18:40 trot /netbsd: wd1: (id not found)
Mar 26 23:18:41 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:18:41 trot /netbsd: wd1: (id not found)
Mar 26 23:18:42 trot /netbsd: wd1e: error writing fsbn 268435392 of 268435392-268435423 (wd1 bn 268435455; cn 266305 tn 0 sn 15)wd1: (id not found)
Mar 26 23:18:42 trot /netbsd:
Mar 26 23:48:05 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:48:05 trot /netbsd: wd1: (id not found)
Mar 26 23:48:05 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:48:05 trot /netbsd: wd1: (id not found)
Mar 26 23:48:06 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:48:06 trot /netbsd: wd1: (id not found)
Mar 26 23:48:07 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:48:07 trot /netbsd: wd1: (id not found)
Mar 26 23:48:07 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15), retrying
Mar 26 23:48:07 trot /netbsd: wd1: (id not found)
Mar 26 23:48:08 trot /netbsd: wd1d: error reading fsbn 268435455 (wd1 bn 268435455; cn 266305 tn 0 sn 15)wd1: (id not found)
Mar 26 23:48:08 trot /netbsd:

however, it happens to be the exact same block which another disk
failed today, on a different controller...


Mar 26 19:47:32 trot /netbsd: satalink0 at pci0 dev 6 function 0
Mar 26 19:47:32 trot /netbsd: satalink0: Silicon Image SATALink 3112 (rev. 0x02)
Mar 26 19:47:32 trot /netbsd: satalink0: SATALink BA5 register space disabled
Mar 26 19:47:32 trot /netbsd: satalink0: bus-master DMA support present
Mar 26 19:47:32 trot /netbsd: satalink0: primary channel wired to native-PCI mode
Mar 26 19:47:32 trot /netbsd: satalink0: using irq 11 for native-PCI interrupt
Mar 26 19:47:32 trot /netbsd: atabus0 at satalink0 channel 0
Mar 26 19:47:32 trot /netbsd: satalink0: secondary channel wired to native-PCI mode
Mar 26 19:47:32 trot /netbsd: atabus1 at satalink0 channel 1

Mar 26 19:47:32 trot /netbsd: viaide0 at pci0 dev 15 function 0
Mar 26 19:47:32 trot /netbsd: viaide0: VIA Technologies VT8237 SATA Controller (rev. 0x80)
Mar 26 19:47:32 trot /netbsd: viaide0: bus-master DMA support present
Mar 26 19:47:32 trot /netbsd: viaide0: primary channel wired to native-PCI mode
Mar 26 19:47:32 trot /netbsd: viaide0: using irq 11 for native-PCI interrupt
Mar 26 19:47:32 trot /netbsd: atabus2 at viaide0 channel 0
Mar 26 19:47:32 trot /netbsd: viaide0: secondary channel wired to native-PCI mode
Mar 26 19:47:32 trot /netbsd: atabus3 at viaide0 channel 1


wd0 and wd1 are on the SATALink 3112, and wd2 is on the VT8237.
these are the commands and subsequent errors... note same block
268435455

 rsync ...

wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2e: error reading fsbn 268435392 of 268435392-268435423 (wd2 bn 268435455; cn 266305 tn 0 sn 15)wd2: (id not found)

 dd if=/dev/rwd2d of=~/wd2.268435455 count=1 skip=268435455
dd: /dev/rwd2d: Input/output error
0+0 records in
0+0 records out
0 bytes transferred in 3.394 secs (0 bytes/sec)

wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error reading fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15)wd2: (id not found)

 dd if=/dev/zero of=/dev/rwd2d seek=268435455 count=1
dd: /dev/rwd2d: Input/output error
1+0 records in
0+0 records out
0 bytes transferred in 3.382 secs (0 bytes/sec)

wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15), retrying
wd2: (id not found)
wd2d: error writing fsbn 268435455 (wd2 bn 268435455; cn 266305 tn 0 sn 15)wd2: (id not found)


wd1 above is a Seagate ST3400832A8 (400GB)
and wd2 is a ST3500641AS (500GB)

The kernel is NetBSD 3.0 (GENERIC), any idea what's wrong here?

// George


-- 
George Georgalis, systems architect, administrator <IXOYE><
http://galis.org/ cell:646-331-2027 mailto:george@galis.org