Subject: lost interrupt, device timeout...
To: None <netbsd-help@netbsd.org>
From: Jukka Salmi <j+nbsd@2004.salmi.ch>
List: netbsd-help
Date: 10/25/2004 18:13:28
Hello,

on a i386 system running -current (built 2004-10-18) the following
was logged today:

viaide0:0: lost interrupt
        type: ata tc_bcount: 16384 tc_skip: 0
viaide0:0:0: bus-master DMA error: missing interrupt, status=0x60
viaide0:0:0: device timeout, c_bcount=16384, c_skip0
wd0a: device timeout writing fsbn 38771488 of 38771488-38771519 (wd0 bn 38771551; cn 38463 tn 13 sn 28), retrying
wd0: soft error (corrected)
viaide0:0: lost interrupt
        type: ata tc_bcount: 16384 tc_skip: 0
viaide0:0:0: bus-master DMA error: missing interrupt, status=0x61
viaide0:0:0: device timeout, c_bcount=16384, c_skip0
wd0a: device timeout writing fsbn 31458560 of 31458560-31458591 (wd0 bn 31458623; cn 31208 tn 15 sn 14), retrying
wd0: soft error (corrected)

That box is running -current for a couple of months now and never showed
these messages before. However, several changes were made to the box
recently which could AFAICT cause the problem:

- Two weeks ago the power supply (350W) failed and was replaced with a
  250W supply. (I'll install the old one as soon it's repaired.)

- Yesterday I changed the order of attached IDE devices:
  * Before: two harddisks on channel 0 (as primary master and primary
    slave), a CD reader and a writer on channel 1 (as secondary master
    and slave).
  * Now: a harddisk (as master) and a CD (as slave) on each channel.

- After this I configured RAID level 1 (RAIDframe) using all of wd0 and
  wd1.

I guess the power supply is not the culprit. Assuming the same applies
to RAIDframe: is it possible these errors are because of how I connected
the devices? Or is there something wrong with wd0?


Comments and hints are appreciated!

TIA, Jukka

-- 
bashian roulette:
$ ((RANDOM%6)) || rm -rf ~