Subject: Re: Hang (SCSI-related?) on 1.3.2
To: Manuel Bouyer <bouyer@antioche.lip6.fr>
From: Gunnar Helliesen <gunnar@bitcon.no>
List: port-i386
Date: 01/13/2000 17:02:45
On Mon, 10 Jan 2000, Manuel Bouyer wrote:

> On Sun, Jan 09, 2000 at 07:26:48PM +0100, Gunnar Helliesen wrote:
> > Does this indicate that sd0 is going bad?
> 
> Well, maybe, if it's not a problem on the scsi chain.
> Don't you have/had overheating problems ?

Not that I know of, no. Both fans are running and the "exhaust" is about
the same temperature as on the other servers (of the same type) that I
have.

The machine just crashed again (half hour ago), but this time it was a new
variant. NetBSD panicked,  but didn't reboot. I found this on the console:


panic: ahc0: Timed-out command times out again

syncing disks...


and there it hung. Shouldn't there be a timeout in the "syncing
disks" routine to make sure that the system does indeed reboot in case of
problems with the disks (which would sometimes, as in this case, make
syncing the disks impossible anyway)?

As usual the machine rebooted and came right back up again with no
complaints (except running fsck of course) as soon as I hit the reset
switch. If there was an overheating problem, wouldn't you expect the disks
or SCSI bus to fail pretty consistently once it had reached a certain
temperature?

Gunnar

--
Gunnar Helliesen   | Bergen IT Consult AS  | NetBSD/VAX on a uVAX II
Systems Consultant | Bergen, Norway        | '86 Jaguar Sovereign 4.2
gunnar@bitcon.no   | http://www.bitcon.no/ | '73 Mercedes 280 (240D)