NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/37881: repeatable crash in filesystem when running MP



The following reply was made to PR kern/37881; it has been noted by GNATS.

From: Antti Kantee <pooka%cs.hut.fi@localhost>
To: dieter.NetBSD%pandora.be@localhost
Cc: gnats-bugs%NetBSD.org@localhost
Subject: Re: kern/37881: repeatable crash in filesystem when running MP
Date: Sun, 27 Jan 2008 15:06:20 +0200

 On Sun Jan 27 2008 at 12:40:00 +0000, dieter.NetBSD%pandora.be@localhost wrote:
 >      Since upgrading my system from 4.99.35 to 4.99.49 and then
 >      4.99.50, I can reproducibly crash it by copying some files
 >      with nautilus from a windows share to my local disk. The
 >      kernel drops in ddb after a uvm_fault with a backtrace like
 >      this:
 > 
 > uvm_fault(0xcd8b1004, 0, 1) -> 0xe
 > knote(cefe5be4,6,0,14c90000,0) at netbsd:knote+0x27
 > ffs_write(cdccbc04,0,c0483fa0,cefe5b20,10002) at netbsd:ffs_write+0x53d
 > VOP_WRITE(cefe5b20,cdccbc7c,10,cdb31540,c0579a84) at netbsd:VOP_WRITE+0x80
 > vn_write(cea71414,cea71440,cdccbc7c,cdb31540,1) at netbsd:vn_write+0xcc
 > dofilewrite(16,cea71414,ba226000,10000,cea71440) at netbsd:dofilewrite+0x75
 > sys_write(cef6eba0,cdccbd00,cdccbd28,ba235000,c037cab5) at 
 > netbsd:sys_write+0x9c
 > syscall(cdccbd48,b3,ba2600ab,ba40001f,ba5f001f) at netbsd:syscall+0xb5
 > 
 >      This does not happen if I disable one of the 2 cores with
 >      cpuctl.
 
 Hmm.  How are you copying the files?  Many at a time?  fd 16 looks
 semi-suspicious.
 
 > #25 0xc02c6727 in knote (list=0xcefe5be4, hint=6)
 >     at /usr/src/sys/kern/kern_event.c:1301
 
 print *list
 print *kn
 
 > #26 0xc025b9ed in ffs_write (v=0xcdccbc04)
 >     at /usr/src/sys/ufs/ufs/ufs_readwrite.c:507
 
 print *uio
 print *vp
 
 > #29 0xc0302cc5 in dofilewrite (fd=22, fp=0xcea71414, buf=0xba226000, 
 >     nbyte=65536, offset=0xcea71440, flags=1, retval=0xcdccbd28)
 >     at /usr/src/sys/kern/sys_generic.c:392
 
 print *offset
 
 print numvnodes
 print desiredvnodes
 
 >      Copy some (large?) files around with nautilus, possibly
 >      from an smb share on a system with more than one CPU active?
 
 Does it happen when you start copying or when you've been copying for
 a while or what?  Be as specific as you can.
 
 
 .. actually, if it's easy to repeat, try this patch, but do the above
 also.  I'm mostly guessing here, as I can't quite wrap my head around
 why this happens now and not before.
 
 Index: vfs_subr.c
 ===================================================================
 RCS file: /cvsroot/src/sys/kern/vfs_subr.c,v
 retrieving revision 1.323
 diff -p -u -r1.323 vfs_subr.c
 --- vfs_subr.c 26 Jan 2008 22:53:10 -0000      1.323
 +++ vfs_subr.c 27 Jan 2008 13:05:54 -0000
 @@ -263,6 +263,7 @@ try_nextlist:
                        spec_node_destroy(vp);
                }
                vp->v_type = VNON;
 +              SLIST_INIT(&vp->v_klist);
        } else {
                /*
                 * Don't return to freelist - the holder of the last
 
 -- 
 Antti Kantee <pooka%iki.fi@localhost>                     Of course he runs 
NetBSD
 http://www.iki.fi/pooka/                          http://www.NetBSD.org/
     "la qualité la plus indispensable du cuisinier est l'exactitude"
 



Home | Main Index | Thread Index | Old Index