Subject: Re: anyone know if there's a fix for this "malloc with held simple_lock" in RAIDframe bug yet?
To: Greg A. Woods <woods@weird.com>
From: Greg Oster <oster@cs.usask.ca>
List: tech-kern
Date: 03/14/2005 09:37:39
"Greg A. Woods" writes:
> I was just trying to set up a RAID-1 mirror of the root drives on an
> alphaserver (with a 1.6.x MP kernel) and the instant I ran "raidctl -C"
> the following spewed forth on the console.
>
> malloc with held simple_lock 0xfffffc00006135c0 CPU 1 /building/work/woods/m-
> NetBSD-1.6/sys/dev/raidframe/rf_driver.c:356
> alpha trace requires known PC =eject=
[snip]
>
> I guess I'm not going to be mirroring the system disk just yet.....
>
>
> In any case does anyone know if there's a fix already in -current for
> this "malloc with held simple_lock" bug yet, and if so where I might
> hope to find it?
All the LOCKDEBUG issues should be fixed in 2.0 (and thus in -current).
> The only difference I see in the -current in that area is that the
> RF_LOCK_MUTEX(configureMutex) call has been changed to be
> RF_LOCK_LKMGR_MUTEX(configureMutex). That may be the fix according to
> the comment associated with the change (rev. 1.55, and yes my kernel
> does also use LOCKDEBUG), but I'd like to get some confirmation before I
> try pulling that change into my own sources.
The change in rev 1.55 may fix this problem, but IIRC there were
quite a few more changes that had to be made before all (?) of the
locking issues were sorted out. (You'll need at least 1.64 and 1.65
for this file, and probably a whole mess of other changes for other
files in RAIDframeland)
If you're looking for "the best RAIDframe", might I recommend you
use 2.0? :) (The code in 2.0 is MUCH better than what shipped in
1.6.x)
Later...
Greg Oster