Subject: problems with raidframe on FC disks
To: None <current-users@netbsd.org>
From: Ben Rosengart <br@panix.com>
List: current-users
Date: 02/27/2002 15:45:33
--envbJBWh7q8WU6mo
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline

I've been using RAIDframe on NetBSD 1.5.2 to stripe eight fibre
channel disks.  This has been working OK, modulo some soft errors
which didn't seem to actually cause any problems.

Yesterday, I switched to a NetBSD-1.5ZA kernel, using yesterday's
sources, in order to solve another problem which I believe to be
unrelated.  Overnight, the machine became unresponsive, with this
in syslog:

Feb 27 03:27:27 reader1 /netbsd: raid1: IO Error.  Marking /dev/sd4a as failed.
Feb 27 03:27:27 reader1 /netbsd: raid1: node (R  ) returned fail,
   rolling backward
Feb 27 03:27:27 reader1 /netbsd: raid1: DAG failure: r addr 0x153c580
   (22267264) nblk 0x20 (32) buf 0xc93c1000
Feb 27 03:31:43 reader1 /netbsd: raid1: IO Error.  Marking /dev/sd3a as failed.
Feb 27 03:31:43 reader1 /netbsd: raid1: node (W  ) returned fail, rolling
   forward
Feb 27 03:31:43 reader1 /netbsd: raid1: IO Error.  Marking /dev/sd6a as failed.
Feb 27 03:31:43 reader1 /netbsd: raid1: node (R  ) returned fail,
   rolling backward
Feb 27 03:31:43 reader1 /netbsd: raid1: IO Error.  Marking /dev/sd7a as failed.
Feb 27 03:31:43 reader1 /netbsd: raid1: node (R  ) returned fail,
   rolling backward
Feb 27 03:32:47 reader1 /netbsd: raid1: node (R  ) returned fail,
   rolling backward
Feb 27 03:33:51 reader1 last message repeated 3 times
Feb 27 03:47:25 reader1 /netbsd: Failed to write RAID component info!
Feb 27 03:53:22 reader1 /netbsd: sd1(isp0:0:57:0): adapter resource shortage

I'm wondering what the source of the IO errors from RAIDframe can
be, when the kernel is not reporting disk errors.

dmesg output from the machine in question is attached.

-- 
Ben Rosengart     (212) 741-4400 x215

"For values of 'whatever'."

--envbJBWh7q8WU6mo
Content-Type: text/plain; charset=us-ascii
Content-Disposition: attachment; filename="dmesg.boot"

NetBSD 1.5ZA (PANIX-STD) #0: Tue Feb 26 18:59:50 EST 2002
    root@trinity.nyc.access.net:/devel/netbsd/current-20020226/src/sys/arch/i386/compile/PANIX-STD
cpu0: AMD Athlon Model 6 (Palomino) (686-class), 1410.35 MHz
cpu0: I-cache 64 KB 64b/line 2-way, D-cache 64 KB 64b/line 2-way
cpu0: L2 cache 256 KB 64b/line 16-way
cpu0: features 383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR>
cpu0: features 383f9ff<PGE,MCA,CMOV,FGPAT,PSE36,MMX,FXSR,SSE>
total memory = 1023 MB
avail memory = 947 MB
using 6144 buffers containing 52504 KB of memory
BIOS32 rev. 0 found at 0xf0bf0
mainbus0 (root)
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled
pchb0 at pci0 dev 0 function 0
pchb0: VIA Technologies VT8366 (Apollo KT266) CPU-PCI Bridge (rev. 0x00)
agp at pchb0 not configured
ppb0 at pci0 dev 1 function 0: VIA Technologies VT8366 CPU-AGP (rev. 0x00)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
vga1 at pci1 dev 0 function 0: S3 Savage4 (rev. 0x03)
wsdisplay0 at vga1 (kbdmux ignored)
C-Media Electronics Inc CMI8738/C3DX PCI Audio Device (audio multimedia, revision 0x10) at pci0 dev 5 function 0 not configured
pciide0 at pci0 dev 6 function 0: Promise Ultra100/ATA Bus Master IDE Accelerator (rev. 0x02)
pciide0: bus-master DMA support present
pciide0: primary channel configured to native-PCI mode
pciide0: using irq 5 for native-PCI interrupt
pciide0: disabling primary channel (no drives)
pciide0: secondary channel configured to native-PCI mode
pciide0: disabling secondary channel (no drives)
isp0 at pci0 dev 15 function 0
isp0: interrupting at irq 12
scsibus0 at isp0: 256 targets, 8 luns per target
fxp0 at pci0 dev 16 function 0: Intel i82557 Ethernet, rev 2
fxp0: interrupting at irq 5
fxp0: Ethernet address 00:a0:c9:6e:9b:cd
inphy0 at fxp0 phy 1: i82555 10/100 media interface, rev. 0
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pcib0 at pci0 dev 17 function 0
pcib0: VIA Technologies VT8233 PCI-ISA Bridge (rev. 0x00)
pciide1 at pci0 dev 17 function 1: VIA Technologies VT8233 ATA100 controller
pciide1: bus-master DMA support present
pciide1: primary channel configured to compatibility mode
wd0 at pciide1 channel 0 drive 0: <ST360021A>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 57241 MB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 117231408 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
pciide1: primary channel interrupting at irq 14
wd0(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA data transfers)
pciide1: secondary channel configured to compatibility mode
wd1 at pciide1 channel 1 drive 0: <ST360021A>
wd1: drive supports 16-sector PIO transfers, LBA addressing
wd1: 57241 MB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 117231408 sectors
wd1: 32-bit data port
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
pciide1: secondary channel interrupting at irq 15
wd1(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA data transfers)
uhci0 at pci0 dev 17 function 2: VIA Technologies VT83C572 USB Controller (rev. 0x1b)
uhci0: interrupting at irq 5
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: VIA Technologie UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1 at pci0 dev 17 function 3: VIA Technologies VT83C572 USB Controller (rev. 0x1b)
uhci1: interrupting at irq 5
usb1 at uhci1: USB revision 1.0
uhub1 at usb1
uhub1: VIA Technologie UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2 at pci0 dev 17 function 4: VIA Technologies VT83C572 USB Controller (rev. 0x1b)
uhci2: interrupting at irq 5
usb2 at uhci2: USB revision 1.0
uhub2 at usb2
uhub2: VIA Technologie UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
isa0 at pcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
lpt0 at isa0 port 0x378-0x37b irq 7
pcppi0 at isa0 port 0x61
sysbeep0 at pcppi0
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
npx0 at isa0 port 0xf0-0xff: using exception 16
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
isapnp0: no ISA Plug 'n Play devices found
apm0 at mainbus0: Power Management spec V1.2
APM power mgmt engage (device 1): power management disabled (0x10f)
biomask ff67 netmask ff67 ttymask ffe7
scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 56 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd0: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd1 at scsibus0 target 57 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd1: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd2 at scsibus0 target 58 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd2: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd3 at scsibus0 target 59 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd3: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd4 at scsibus0 target 60 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd4: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd5 at scsibus0 target 61 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd5: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd6 at scsibus0 target 62 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd6: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
sd7 at scsibus0 target 63 lun 0: <SEAGATE, ST19171FC, FB59> SCSI2 0/direct fixed
sd7: 8683 MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
Kernelized RAIDframe activated
RAID autoconfigure
Configuring raid0:
RAIDFRAME: protectedSectors is 64
RAIDFRAME: Configure (RAID Level 1): total number of sectors is 8388480 (4095 MB)
RAIDFRAME(RAID Level 1): Using 6 floating recon bufs with no head sep limit
RAID autoconfigure
Configuring raid1:
RAIDFRAME: Configure (RAID Level 0): total number of sectors is 37631488 (18374 MB)
RAIDFRAME(RAID Level 0): Using 24 floating recon bufs with no head sep limit
uhub2: port error, restarting port 2
boot device: raid0
root on raid0a dumps on raid0b
root file system type: ffs
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)

--envbJBWh7q8WU6mo--