NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
kern/58043: kernel crash in -current
>Number: 58043
>Category: kern
>Synopsis: kernel crash in assert_sleepable() in -current, dk(4) driver?
>Confidential: no
>Severity: serious
>Priority: high
>Responsible: kern-bug-people
>State: open
>Class: sw-bug
>Submitter-Id: net
>Arrival-Date: Sat Mar 16 15:05:00 +0000 2024
>Originator: Paul Goyette
>Release: NetBSD 10.99.10
>Organization:
+---------------------+--------------------------+----------------------+
| Paul Goyette (.sig) | PGP Key fingerprint: | E-mail addresses: |
| (Retired) | 1B11 1849 721C 56C8 F63A | paul%whooppee.com@localhost |
| Software Developer | 6E2E 05FD 15CE 9F2D 5102 | pgoyette%netbsd.org@localhost |
| & Network Engineer | | pgoyette99%gmail.com@localhost |
+---------------------+--------------------------+----------------------+
>Environment:
System: NetBSD speedy.whooppee.com 10.99.10 NetBSD 10.99.10 (SPEEDY 2024-03-13 18:25:47 UTC) #0: Wed Mar 13 20:05:25 UTC 2024 paul%speedy.whooppee.com@localhost:/build/netbsd-local/obj/amd64/sys/arch/amd64/compile/SPEEDY amd64
Architecture: x86_64
Machine: amd64
>Description:
At unpredictable times, but always under heavy disk load (ie,
build.sh runnning with -j30) I am seeing random crashes. I
have a crash dump from one of these crashes, and stack trace
seems to implicate the disk driver:
Crash version 10.99.10, image version 10.99.10.
crash: _kvm_kvatop(0)
Kernel compiled without options LOCKDEBUG.
System panicked: dump forced via kernel debugger
Backtrace from time of crash is available.
crash> bt
end() at 0
kern_reboot() at kern_reboot+0x87
db_sync_cmd() at db_sifting_cmd
db_command() at db_command+0x123
db_command_loop() at db_command_loop+0x1c7
db_trap() at db_trap+0xcc
kdb_trap() at kdb_trap+0x106
trap() at trap+0x2de
--- trap (number 1) ---
breakpoint() at breakpoint+0x5
vpanic() at vpanic+0x173
panic() at printf_nostamp
assert_sleepable() at assert_sleepable+0x99
pool_cache_get_paddr() at pool_cache_get_paddr+0x13c
end() at ffffffff813ad275
bdev_strategy() at bdev_strategy+0x81
spec_strategy() at spec_strategy+0x6e
VOP_STRATEGY() at VOP_STRATEGY+0x3c
dkstart() at dkstart+0x13e
dkiodone() at dkiodone+0xa6
lddone() at lddone+0x10
nvme_q_complete() at nvme_q_complete+0xff
softint_dispatch() at softint_dispatch+0x112
DDB lost frame for Xsoftintr+0x4c, trying 0xffffd220dfd9d0f0
Xsoftintr() at Xsoftintr+0x4c
--- interrupt ---
0:
I've had several other similar crashes, although I haven't
saved dump details. All stack traces seem to have pointed
in the same area, and all fail at the assert_sleepable().
Config and/or dmesg are available. One item of note is that
this machine contains multiple SSDs, and in one case I have
a ccd(4) of two 2-TB CCD partitions (each of which occupies
a complete SSD device).
>How-To-Repeat:
No specific recipe to reproduce, it is seeming random when
under high disk activity.
>Fix:
Please. In fact, pretty-please.
>Unformatted:
Home |
Main Index |
Thread Index |
Old Index