Subject: Re: raidframe copyback blocks the whole system !?
To: None <kilbi@rad.rwth-aachen.de>
From: Greg Oster <oster@cs.usask.ca>
List: current-users
Date: 06/25/2002 16:47:43
Markus W Kilbinger writes:
> >>>>> "Greg" == Greg Oster <oster@cs.usask.ca> writes:
> 
>     Greg> Well... if you have hot-swap drives, then you could just
>     Greg> stuff the new drive in place of the old one, and not worry
>     Greg> about doing a copyback.. (i.e. the new drive you put in
>     Greg> becomes the hot spare).
>     >> 
>     >> Hmm, how to come!? The old/new drive still has state 'failed',
>     >> the spare is 'used_spare'. So, what will happen if another
>     >> drive fails in this stage?
> 
>     Greg> You stuff another drive into the box, hot-add it, and
>     Greg> rebuild to it :)
> 
> Let's proceed: ;-)
> 
> I just can place in a new drive in place of a failed one. -> A known,
> but 'failed' scsi id / /dev/sd.. refers to a fresh drive now. ->
> Adding this as a new spare ('raidctl -a /dev/sd..', I guess) would
> work!? (I wouldn't even dare...)

Hmm... I don't think I've ever tried exactly what you suggest :) 
I have put a fresh drive (with a SCSI ID different from all other drives, 
including the failed drive) into the system, and hot-added that... 
I'm not sure if it would let you hot-add /dev/sd1e even after /dev/sd1e 
failed, was removed, and was added back in again... (if the device was closed, 
it might, but otherwise it wouldn't...)

>     >> How will the 'failed' drive become the new spare _without_
>     >> reboot?
> 
>     Greg> You stuff a fresh drive into that slot, and when a different
>     Greg> drive fails, rebuild to the fresh drive. Not quite the same
>     Greg> as a spare, but works much the same. :)
> 
> But then you've lost the real spare / redundancy effect, because then
> the first reconstruction step (onto the spare drive), normally done
> automatically, has to be done manually, now.

Except the first reconstruction isn't done automatically right now either :-}
 
> -> Then it would be better to handle the complete spare drives
> manually, because there are only smooth reconstructions. Having a
> 'used_spare' means either blocking copyback or a (reordering) reboot.
> 
> Or is 'raidctl -C ...' allowed with a used running raid!? (Another
> thing I wouldn't dare! ;-))

No.. you don't want to be doing 'raidctl -C ...'s after you get a set 
configured, and it won't let you do 'raidctl -C' when it's already running.
(You really only want to do the 'raidctl -C' the first time, and then let 
the autoconfig stuff do its magic after that :) )

Later...

Greg Oster