Re: Problem with raidframe under NetBSD-3 and NetBSD-4

To: Greg Oster <oster%cs.usask.ca@localhost>
Subject: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
From: buhrow%lothlorien.nfbcal.org@localhost (Brian Buhrow)
Date: Fri, 4 Apr 2008 10:52:56 -0700
        Hello Greg.  I have two machines with mirored 1TB disks installed.
Here is the output from dmesg concerning the raid on the working machine.
Note that I've not actually tried rebuilding a replacement disk on this
machine, I just built thins with the disks initially.
        The raid was built under NetBSD-3.1, but is now running 4.0-stable.

Good machine:
raid0: RAID Level 1
raid0: Components: /dev/wd0a /dev/wd1a
raid0: Total Sectors: 1953524992 (953869 MB)
boot device: raid0
root on raid0a dumps on raid0b
root file system type: ffs
raid0: Device already configured!

        Now, contrast that with the broken machine:

raid0: RAID Level 1
raid0: Components: component0[**FAILED**] /dev/wd0a
raid0: Total Sectors: 1953524992 (953869 MB)
boot device: raid0
root on raid0a dumps on raid0b
root file system type: ffs
        Unfortunately, I don't have what I think you want to see.  That is, I
don't have the messages that came from the kernel after I initiated the
copyback but before I rebooted the machine.  You're correct that  the
machine doesn't reboot, but begins acting strangely, as all the processes
line up to get stuff off the disks, which no longer return data.
Eventually, all processes end up in disk wait.
        I was wondering about the size of these individual disks, and
wondering if I was running into some size limitation, since it seems like what's
happening, as I read the source, is that the reconstruction starts, but
then fails silently for some reason.  Or, atleast, I can't see the failure
message because dmesg stops returning messages shortly after this process
begins.  Worsse, after the failure occurs, the raid system fails to unlock
the raid set, meaning no other processes can get to it.  Unless I'm
mistaken, a very likely possibility since I'm not familiar with this code,
it looks to me like there are several places in the raidioctl() function in
rf_netbsdkintf.c where a malloc occurs, and an error is returned if the
malloc fails without unlocking the raid mutex.  If that's true in
raidioctl, I wonder if it could be true in other places in the code?
        Another question, has anyone else set up raid sets with disks this
large?

        Also, here is the disk label for the disks in the raid set in
question.

-thanks
-Brian

# /dev/rwd0d:
type: unknown
disk: st31000340ns
label: 
flags:
bytes/sector: 512
sectors/track: 63
tracks/cylinder: 16
sectors/cylinder: 1008
cylinders: 1938021
total sectors: 1953525168
rpm: 10000
interleave: 1
trackskew: 0
cylinderskew: 0
headswitch: 0           # microseconds
track-to-track seek: 0  # microseconds
drivedata: 0 

4 partitions:
#        size    offset     fstype [fsize bsize cpg/sgs]
 a: 1953525105        63       RAID                     # (Cyl.      0*- 
1938020)
 c: 1953525105        63     unused      0     0        # (Cyl.      0*- 
1938020)
 d: 1953525168         0     unused      0     0        # (Cyl.      0 - 
1938020)
On Apr 4, 10:45am, Greg Oster wrote:
} Subject: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
} Brian Buhrow writes:
} >     Hello.  I've been using raidframe with NetBSD for about 8 years with
} > great success.  Just today, I've run into a situation I've never seen
} > before, and I am not sure how to resolve it.
} > 
} >     I am running a NetBSD-3.1 system with a raid 1 mirrored route.  This
} > system is a Sunfire X2200 with 2 SATA disks mirrored together.
} >     All was working well, but I decided to image the system onto two new
} > drives of a smaller size, due to the fact that I'm using 1TB drives for a
} > system that requires less than 3GB of disk space.  In any case, the
} > procedure I thought I'd use was the following:
} > 
} > 1.  Use raidctl -f /dev/wd1a raid0 to fail the second drive and allow me to
} > pull it out of the system.
} > 
} > 2.  Insert the new disk and then label it and configure it as a second raid
} > device with a missing component.
} > 
} > 3.  Dump/restore the running image onto the new drive.
} > 
} > 4.  Load the new drive into the new machine, and add a second drive to
} > that.
} > 
} > 5.  Re-insert the original drive from step 1 and rebuild the raid to it.
} > 
} >     I got as far as step 5, but than ran into trouble.
} > 
} >     Due to some canoodling around on my part, it was necessary to reboot
} > the system a few times during these steps, so I ended up with a raid set
} > that looks like:
} > 
} > Components:
} >           component0: failed
} >            /dev/wd0a: optimal
} > No spares.
} > component0 status is: failed.  Skipping label.
} > Component label for /dev/wd0a:
} >    Row: 0, Column: 1, Num Rows: 1, Num Columns: 2
} >    Version: 2, Serial Number: 20071030, Mod Counter: 135
} >    Clean: No, Status: 0
} >    sectPerSU: 64, SUsPerPU: 1, SUsPerRU: 1
} >    Queue size: 100, blocksize: 512, numBlocks: 1953524992
} >    RAID Level: 1
} >    Autoconfig: Yes
} >    Root partition: Yes
} >    Last configured as: raid0
} > Parity status: DIRTY
} > Reconstruction is 100% complete.
} > Parity Re-write is 100% complete.
} > Copyback is 100% complete.
} > 
} >     "No problem", I thought.   I'll do:
} > 
} > raidctl -a /dev/wd1a raid0
} > <Works fine, the second disk shows up as a hot spare>
} > 
} > raidctl -F component0 raid0
} > <Here's where the trouble starts>
} > 
} >     What happens after this command is run is just strange.
} > First, all seems OK, I get the messages from the kernel about starting
} > reconstruction.  Then, a check of 
} > raidctl -S
} > shows that parity, copyback and reconstruction are all 100% complete.
} > (This mere seconds after runing the reconstruction command above.)
} 
} Could you send me any raid/disk-related contents of /var/log/messages 
} from around these times?
} 
} >     At this point, I notice that the system is starting to shut down,
} 
} You mean "shut down" as in "start acting strangely and beginning to 
} stop responding", not as in "starting to reboot...", right?
} 
} > and
} > that anything needing disk i/o hangs.  The raidframe -s command hangs on
} > the ps string raidframe.  The only way to recover  is to power cycle the
} > machine.
} >
} >     I've tried a NetBSD-4 kernel with this setup, and it demonstrates the
} > same behavior.  I've verified that both sata ports are working properly and
} > at full speed.  I've also verified that both drives are in working order.
} >     What this seems like is that the still working drive of the raid got
} > corrupted in some way that fsck doesn't detect, but that raidframe
} > absolutely doesn't like. 
} 
} RAIDframe only reads/writes blocks -- it knows nothing about whether 
} things are corrupted, other than if it gets an IO error from an 
} underlying device...
} 
} > I'm willing to believe I made an error in my
} > procedure which is shooting me in the foot,
} 
} What you describe should work fine...
} 
} > but I'll be darned if I can
} > figure out how to recover from this situation.
} >     Has anyone else seen this problem, and if so, does anyone have
} > suggestions about what I might do to get my drives mirrored again?
} >     If it helps, I've got several other X2200 machines running with
} > similar configurations, and they're happy to build drives all day long
} > without a hitch.
} 
} "build drives" in what way?  A reconstruct is significantly different 
} from a "parity rewrite" in terms of the IO load -- the reconstruct is 
} going to be reading one disk and writing to another, while the 
} rewrite is going to be primarily just reading from both disks (and 
} writing only when it has to).
} 
} >  I don't get any errors from the disk drivers saying they
} > can't read or write a drive, and reading and writing both drives is working
} > fine in other contexts.  So, I think there's something woing with the good
} > mirror, but I'm not sure what.
} > Any thoghts?
} 
} Oh.. hmmmmmmm.... 
} 
} requoting:
} >    Queue size: 100, blocksize: 512, numBlocks: 1953524992
} 
} I wonder with numBlocks that high if we're running into a 32-bit 
} overflow issue somewhere....  Likely related to the number of 
} stripes...  Hopefully your /var/log/messages has some additional 
} information that might point us in the right direction..
} 
} Later...
} 
} Greg Oster
} 
} 
>-- End of excerpt from Greg Oster
Follow-Ups:
- Re: Problem with raidframe under NetBSD-3 and NetBSD-4
  - From: Greg Oster
Prev by Date: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
Next by Date: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
Previous by Thread: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
Next by Thread: Re: Problem with raidframe under NetBSD-3 and NetBSD-4
Indexes:
Home | Main Index | Thread Index | Old Index