Subject: Problems with NFS mount from rebooted Solaris/x86 boxes
To: None <netbsd-users@netbsd.org>
From: Paul J. Lavoie <pjl@ilx.com>
List: netbsd-users
Date: 05/21/2002 17:08:23
Here's an odd problem which has been plaguing me since the days of 1.3 that I
am now fed up with and am looking to get resolved...
I have a series of systems that mount various servers for easy access of log
information, and these servers get restarted every now and then. Every now and
then the box gets into an incoherent state and needs to be restarted due to
unresolvable NFS issues. All in all there are over 500 mounts in total on the
client system.
The client was originally configured using amd, but this seemed to be
particularly unstable, with the instability increasing from 1.3 to 1.4, but
seems a little better under 1.5 and -current. Trying to get to the bottom of
this, the amd maps were configured to fixed mounts, and the only mounts that
seem to be causing a real issue are those to a handful of Solaris/x86 boxes,
and in particular after the Solaris box has been rebooted.
Trying to see what traffic was being sent between the machines resulted in
seeing null rpc procedures being called every minute or so from the BSD to the
Solaris box with a successful return, but no luck figuring out where things
were wedged.
Does anyone have any suggestions on how to go troubleshooting this particular
setup? I'm willing to go digging into code, but would like a couple of
pointers before starting to hunt around...
Thanks.
-pjl
-------------------------------------------------------------------------------
Paul J. Lavoie pjl@ilx.com (212) 510-3029
ILX Systems, Inc. 111 Fulton St New York, NY 10038