Subject: Re: raidframe problems (revisited)
To: Matthias Scheler <tron@zhadum.org.uk>
From: Greg Oster <oster@cs.usask.ca>
List: netbsd-users
Date: 06/04/2007 09:06:19
Matthias Scheler writes:
> On Fri, Jun 01, 2007 at 11:02:07AM -0600, Greg Oster wrote:
> > You say the machine freezes -- it does that when you attempt to do a 
> > 'raidctl -F' or 'raidctl -R', yes?  Can you get into ddb at that 
> > point?  Can you ping the box at that point?  My next guess is that 
> > it's a kernel memory issue of some sort... 
> 
> And it seem you were right about that. After I increased the available
> kernel memory from 128MB (default for a 1GB machine) to 256MB I could
> rebuild the RAID without the machine getting stuck:
> 
> raid0: Reconstruction of disk at col 0 completed
> raid0: Recon time was 7976.858062 seconds, accumulated XOR time was 0 us (0.0
> 00000)
> raid0:  (start time 1180943157 sec 172287 usec, end time 1180951134 sec 30349
>  usec)
> raid0: Total head-sep stall count was 0
> raid0: 6678754 recon event waits, 4 recon delays
> raid0: 3244449136 max exec ticks
> 
> The machine is still up and running.

Interesting!!!  So I wonder if it's a case where RAIDframe gets its 
memory, and then something else runs out and is unhappy, or if 
RAIDframe is running out (for a rebuild, it should be willing to wait 
for memory to come free)...

Later...

Greg Oster