NetBSD-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Netbsd 5.0RC2 amd64, system raid fail to reconstruct



Hi guys,

After a power failure my 2 raid became out of sync.
I use Netbsd 5.0RC2 amd64 kernel and a 4.99.72 userland.

First raid (raid0) is the system raid it uses 2 drives in raid level 1
Second raid (raid1) is data raid it uses 3 drives in raid level 5

For each raid, one component failed and has a different modification counter.

On the second raid (raid1) I:
   - umount all related mount points
   - raidctl -u raid1
   - dd if=/dev/zero of=/dev/rwd2d bs=8k count=1
   - fdisk -0u /dev/rw2d
   - disklabel -R -r disklabel.wd2 wd2
   - redef raid using an absent component instead of the one out of
sync (raidctl -C)
   - raidctl -I 200903191 raid1
   - raidctl -A yes raid1
   - shutdown -r now
   - raidctl -a /dev/wd2e raid1
   - raidctl -F component0 raid1
   - shutdown -r now

All was OK

Then I tryed with the system raid that I can't undef because my root is on raid0
I used jibbed liveCD that use netbsd 5.0RC1 amd64
I did the same thing:
   - raidctl -A no raid0
   - raidctl -u raid0
   - dd if=/dev/zero of=/dev/rwd1d bs=8k count=1
   - fdisk -0ua /dev/rw1d
   - disklabel -R -r disklabel.wd1 wd1
   - redef raid using an absent component instead of the one out of
sync (raidctl -C)
   - raidctl -I 200903201 raid0
   - raidctl -A root raid0
   - shutdown -r now
reboot on the local system Netbsd 5.0RC2 amd64
   - raidctl -a /dev/wd1a raid1
   - raidctl -F component1 raid1

The reconstruction failed with
raid0: Recon read failed: 5

raidctl -s raid0 while reconstructing showed me the old raid
configuration on /dev/wd1a

Then when finished:
poire# raidctl -s raid0
Components:
           /dev/wd0a: optimal
          component1: failed
Spares:
           /dev/wd1a: spare
Component label for /dev/wd0a:
   Row: 0, Column: 0, Num Rows: 1, Num Columns: 2
   Version: 2, Serial Number: 200903201, Mod Counter: 963
   Clean: No, Status: 0
   sectPerSU: 128, SUsPerPU: 1, SUsPerRU: 1
   Queue size: 100, blocksize: 512, numBlocks: 625142272
   RAID Level: 1
   Autoconfig: Yes
   Root partition: Yes
   Last configured as: raid0
component1 status is: failed.  Skipping label.
/dev/wd1a status is: spare.  Skipping label.
Parity status: DIRTY
Reconstruction is 100% complete.
Parity Re-write is 100% complete.
Copyback is 100% complete.

poire# raidctl -g /dev/wd1a raid0
Component label for /dev/wd1a:
   Row: 0, Column: 1, Num Rows: 1, Num Columns: 2
   Version: 2, Serial Number: 200710070, Mod Counter: 829
   Clean: No, Status: 0
   sectPerSU: 128, SUsPerPU: 1, SUsPerRU: 1
   Queue size: 100, blocksize: 512, numBlocks: 625142272
   RAID Level: 1
   Autoconfig: No
   Root partition: No
   Last configured as: raid7


Did I miss something ?

Thomas


Home | Main Index | Thread Index | Old Index