Subject: Re: ccd problem?
To: Ray Phillips <r.phillips@jkmrc.uq.edu.au>
From: Manuel Bouyer <bouyer@antioche.eu.org>
List: port-i386
Date: 12/16/2002 21:45:35
On Sat, Dec 14, 2002 at 02:34:45PM +1000, Ray Phillips wrote:
> I'm running NetBSD 1.6 on a Pentium II 350 MHz machine with two IDE
> disks (one's a master on the primary controller, the other's a master
> on the secondary), both of which are practically new. The system had
> been behaving nicely and I was almost ready to put it into production
> use.
>
> However, I thought I'd make use of three old SCSI disks so I used ccd
> to concatenate them with no striping. (My intention was to connect
> them all to a single SCSI controller when I got a ribbon cable with
> four connectors, but for the time being I connected one disk to one
> controller and two to another.) I made sure to offset the ccd
> partition on each disk by one cylinder, and made their filesystem
> type ccd. The contents of /etc/ccd.conf were:
>
> # ccd ileave flags component devices
> ccd0 0 none /dev/sd0e /dev/sd1e /dev/sd2e
>
> Executing 'ccdconfig -vC' seemed to work, as did 'newfs
> /dev/rccd0e'. I mounted the partition with 'mount /dev/ccd0e /mnt',
> copied a directory (named photos, about 550 MB in size) to /mnt and
> ran 'diff -r photos /mnt/photos' which said one of the binary files
> was different but it didn't complete its tests because the system
> froze and I had to power cycle it. These messages were on the
> console before I did:
>
> sd1(trm0:0:1:0): SCSI OpCode 0x0a timed out
> trm0: over/under run error
> sd1(trm0:0:1:0): generic HBA error
> ccd0: error 5 on component 1
>
> Now when I power up the machine (and during use) errors like this
> appear on the console:
>
> wd0: transfer error, downgrading to Ultra-DMA mode 1
> wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 1 (using DMA data
> transfers)
> wd0a: error reading fsbn 1051232 of 1051232-1051233 (wd0 bn 1051295;
> cn 1042 tn 15 sn 14), retrying
> wd0: (uncorrectable data error)
> wd0: transfer error, downgrading to DMA mode 2
> wd0(pciide0:0:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
> wd0a: error reading fsbn 1051232 of 1051232-1051233 (wd0 bn 1051295;
> cn 1042 tn 15 sn 14), retrying
> wd0e: error reading fsbn 41177552 of 41177552-41177567 (wd0 bn
> 46465520; cn 46096 tn 11 sn 59), retrying
> wd0: (uncorrectable data error)
> wd0: soft error (corrected)
>
> Do you think the fact that these errors started immediately I began
> to use a ccd partition was just a bizarre coincidence or are they
> linked somehow? The impression I got from scanning the mail archives
> is ccd is mature and robust these days.
Are your SCSI disks internal or external ?
If internal it's possible that your power supply it too short.
--
Manuel Bouyer <bouyer@antioche.eu.org>
NetBSD: 23 ans d'experience feront toujours la difference
--