Subject: Re: LFS not quite stable
To: Konrad Schroder <perseant@hhhh.org>
From: Gary Duzan <gary@wheel.tiac.net>
List: current-users
Date: 04/14/1999 21:37:46
In Message <Pine.NEB.3.96.990414141456.4152P-100000@hhhh.hitl.washington.edu> ,
   Konrad Schroder <perseant@hhhh.org> wrote:

=>I believe I've fixed LFS' locking problem (in -current as of
=>lfs_syscalls.c v1.30), and the lfs_ifind problem this past weekend ... let
=>me know if you still have any problems.  (Or if you don't, I'd be more
=>glad to hear that! ;^)
=>						Konrad Schroder
=>						perseant@hhhh.org

   Have a new crash after building a newly supped kernel:

NetBSD wheel 1.4_ALPHA NetBSD 1.4_ALPHA (WHEEL) #0: Wed Apr 14 13:28:23 EDT 1999     gary@wheel:/usr/src/sys/arch/i386/compile/WHEEL i386

wheel { /var/crash } # gdb netbsd.0
GNU gdb 4.17
Copyright 1998 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386--netbsd"...(no debugging symbols found)...
(gdb) target kcore netbsd.0.core
panic: lockmgr: locking against myself
#0  0xf01e2e43 in lfs_vref ()
(gdb) where
#0  0xf01e2e43 in lfs_vref ()
#1  0xf0200833 in cpu_reboot ()
#2  0xf0146779 in panic ()
#3  0xf01e2eef in lfs_vunref ()
#4  0xf01e0dee in lfs_writevnodes ()
#5  0xf01e0f40 in lfs_segwrite ()
#6  0xf01e5594 in lfs_sync ()
#7  0xf0160444 in sys_sync ()
#8  0xf015f6da in vfs_shutdown ()
#9  0xf020080b in cpu_reboot ()
#10 0xf0146779 in panic ()
#11 0xf013ac22 in lockmgr ()
#12 0xf01de134 in lfs_vfree ()
#13 0xf01e7e71 in ufs_inactive ()
#14 0xf015e3f0 in vrele ()
#15 0xf016abea in union_freevp ()
#16 0xf016dcfe in union_reclaim ()
#17 0xf015e6ad in vclean ()
#18 0xf015e823 in vgonel ()
#19 0xf015d981 in getnewvnode ()
#20 0xf01dcca4 in ffs_vget ()
#21 0xf01e8c7e in ufs_lookup ()
#22 0xf016c131 in union_lookup1 ()
#23 0xf016c44f in union_lookup ()
#24 0xf015d004 in lookup ()
#25 0xf015ccf3 in namei ()
#26 0xf0161db8 in sys___stat13 ()
#27 0xf02079aa in syscall ()
#28 0xf0100e09 in syscall1 ()
can not access 0xefbfd518, invalid translation (invalid PDE)
can not access 0xefbfd518, invalid translation (invalid PDE)
Cannot access memory at address 0xefbfd518.
(gdb) quit
wheel { /var/crash } # 

   At the time of the crash I was doing a build, and it was somewhere
after building includes in gnu. The build is made on a union fs,
with the source in FFS mounted under an LFS. I'll try it again to get
a better fix on where the crash occurs.

					Gary D. Duzan