tech-kern archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: [10.99.12] Panic (softints stuck)



> Date: Sun, 16 Feb 2025 16:24:20 +0100
> From: BERTRAND Joël <joel.bertrand%systella.fr@localhost>
> 
> 	System runs a kernel with sys/arch/amd64/amd64/machdep.c
> rev. 1.370. Panic occurs this morning and.. no crash dump in /var/crash...

What does /var/run/rc.log say about savecore?

Please test crash dumps in your configuration before you spend weeks
waiting for the symptom to randomly manifest again!  Test a similar
configuration, say in a VM, if you absolutely cannot take this machine
down for a test at a predictable time -- otherwise you'll continue
having to take it down at unpredictable times anyway.

> Feb 16 14:57:49 legendre /netbsd: [ 2118436.0016171] dumping to dev 18,1
> (offset=253015, size=4162677):

Do you log the serial console output?  I would be curious to see if
anything was printed after that.

The last time around, in PR 59024, you shared these two lines of
output:

> [ 417509.006761] dumping to dev 18,1 (offset=253015, size=4162677):
> [ 417509.006761] dump device bad

The second line of output is missing in what you just quoted this
time, which is curious.  I also added a lot of diagnostics in rev.
1.371 that should appear between those two lines and perhaps shed some
light on why the dumps are failing (tracked in PR kern/59024 `dump
fails on raid0b' <https://gnats.NetBSD.org/59024>).  If you're running
with 1.370 then you're already on current, so I strongly recommend you
update to current with 1.371 _and test crash dumps_.


Home | Main Index | Thread Index | Old Index