Subject: Re: 2.0beta freezes
To: None <sigsegv@rambler.ru>
From: Sean Davis <dive@endersgame.net>
List: port-i386
Date: 09/12/2004 13:06:20
On Sun, Sep 12, 2004 at 04:48:56PM +0100, sigsegv@rambler.ru wrote:
> Jukka Marin wrote:
> 
> >I have experienced a few freezes running 2.0beta kernels.  A moment ago,
> >I was about to save an image from gimp and the system froze.  The X display
> >remained visible, but the mouse and keyboard were all dead (no caps lock
> >operation even).  I waited for some 15 minutes and the system didn't reboot
> >or recover.  All the time, I was able to ping the system from a remote
> >machine.
> >
> >I have seen this before.  I thought it was a disk problem, replaced the
> >disk with a new one (different make as well), no help.  Sometimes the
> >system freezes for about 10 seconds and then comes back alive, sometimes
> >I have to push the reset button to get it going again.
> >
> >This has been going on for months.  The system is usually stable, but every
> >now and then (one or twice a month) it freezes like this.  I think it 
> >happens
> >during heavy disk / memory load, but I'm not completely sure.
> >
> > -jm
> >
> >
> > 
> >
> I had similar problems, which were attributed to mounting filesytems 
> with softdep option. I did a simple test, downloaded a 40MB tar file 
> (e.g. mozilla.tar.gz) and then unpacked it many times over and over 
> again, sooner or later, the whole system would just freeze. Mounting all 
> my filesystems without softdep option made the problem go away. I am not 
> a kernel programmer, but from what I can see, softdep code in NetBSD may 
> have some serious bugs.

It does. It was stable for years, but recently (I believe since UBC
happened) it's returned to dont-trust-it-further-than-you-can-throw-it
status. I had close to a dozen panics all in the softdep code. Now that I've
pulled it from all my kernels, and run everything with normall ffsv1 no
softdep, I can load the hell out of any of my machines and they won't crash.

Also, I've had 2.0_BETA crash on sparc64 for no known reason quite a few
times - switched that machine to -current (Sun Ultra 1E Model 170, aka Sun
Ultra 1 Creator), and no crashes since.

I'm of the opinion that 2.0_BETA is way more _BETA than -current is ;-)

-Sean

--
/~\ The ASCII
\ / Ribbon Campaign                   Sean Davis
 X  Against HTML                       aka dive
/ \ Email!