Subject: kern/10441: softdeps panics (again)
To: None <gnats-bugs@gnats.netbsd.org>
From: Martin Husemann <martin@rumolt.teuto.de>
List: netbsd-bugs
Date: 06/25/2000 11:37:12
>Number:         10441
>Category:       kern
>Synopsis:       new softdep code under BSD license panics again
>Confidential:   no
>Severity:       critical
>Priority:       high
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sun Jun 25 11:38:00 PDT 2000
>Closed-Date:
>Last-Modified:
>Originator:     Martin Husemann
>Release:        1.5A cvs updated a few hours ago
>Organization:
>Environment:
	
System: NetBSD rumolt.teuto.de 1.5A NetBSD 1.5A (RUMOLT) #0: Sun Jun 25 16:52:42 MEST 2000 martin@rumolt.teuto.de:/usr/src/sys-i4b/arch/i386/compile/RUMOLT i386


>Description:

After switching to a new kernel with the ffs_softdep.c not in gnu/sys anymore,
I had a panic when doing a "make cleandir" in /usr/src.

Backtrace from the saved dump is:

(gdb) target kcore netsd.11.core
panic: softdep_disk_write_complete: lock is held
#0  0xc026e980 in vga_accessops ()
(gdb) bt
#0  0xc026e980 in vga_accessops ()
#1  0xee26000 in ?? ()
#2  0xc01f4e2b in cpu_reboot ()
#3  0xc01378f0 in log ()
#4  0xc0128bd6 in lockmgr ()
#5  0xc015b6fc in genfs_lock ()
#6  0xc015a587 in vn_lock ()
#7  0xc0153062 in vget ()
#8  0xc01d8d01 in ffs_sync ()
#9  0xc0158f81 in sys_sync ()
#10 0xc0153ef3 in vfs_shutdown ()
#11 0xc01f4e03 in cpu_reboot ()
#12 0xc01378f0 in log ()
#13 0xc01d3474 in softdep_disk_write_complete ()
#14 0xc014e7ef in biodone ()
#15 0xc0201b6e in wddone ()
#16 0xc0204967 in wdc_ata_bio_done ()
#17 0xc02041f4 in wdc_ata_bio_intr ()
#18 0xc0109919 in wdcintr ()
#19 0xc021044a in pciide_compat_intr ()
#20 0xc01018c4 in Xintr14 ()
#21 0xc01cfac2 in softdep_setup_allocindir_page ()
#22 0xc01ccaac in ffs_balloc ()
#23 0xc01d9f14 in ffs_write ()
#24 0xc015a1f2 in vn_write ()
#25 0xc013b4b3 in dofilewrite ()
#26 0xc013b716 in sys_write ()
#27 0xc01f8e2c in syscall ()
#28 0xc0100d49 in syscall1 ()
can not access 0xbfbfd898, invalid translation (invalid PDE)
can not access 0xbfbfd898, invalid translation (invalid PDE)
Cannot access memory at address 0xbfbfd898.

I'm not sure about the quality of this dump: while savecore wrote it
another panic happend: "lockmgr: locking against myself", and it dumped.
So maybe this is from the second panic, but had the first panic message
preserved?

>How-To-Repeat:
Sorry, don't know.

>Fix:
n/a
>Release-Note:
>Audit-Trail:
>Unformatted: