[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
NetBSD 9.2 (STABLE) kernel hangs without panic or ddb
My main server runs NetBSD 9.2 (STABLE). It has 16 GB of RAM, an i7 CPU
(i7-4770), a lot of disks (wd0/wd1 configured as ccd [swap for diskless
workstation], wd2/wd3 as raid level 1 [system], wd4/wd5/wd6 as raid
level 5 [home] and an external NAS [iscsi target, bacula archives]). It
runs a customized kernel as I have configured ALTQ. I have noticed that
I cannot stop altqd or reload configuration without trouble : when altqd
stops or restarts, it always takes 100% of a CPU...
Ethernet link used to access to NAS is configured on both sides with
MTU=9000 (wm0) :
wm0: flags=0x8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 9000
media: Ethernet autoselect (1000baseT full-duplex)
inet 192.168.12.1/24 broadcast 192.168.12.255 flags 0x0
inet6 fe80::b696:91ff:fe92:776e%wm0/64 flags 0x0 scopeid 0x1
I don't know when I have seen this issue the first time, but if I
remember, 9.0 ran fine. Maybe 9.1 also. With 9.2, when bacula starts its
monthly archive (around 50 files, 50 GB each on a ffs2 filesystem),
kernel can crash, randomly. Since yesterday, I have seen two crashes.
System doesn't respond anymore, kernel doesn't enter in ddb even if ddb
is set in sysctl, magic request doesn't do anything. Kernel doesn't
panic. I have tried to access to serial console, no answer. No dump
file. System only stops, maybe on a mutex or a spinlock.
If I umount iscsi target, system seems to be stable. I don't know it
this issue is related to iscsi initiator or related to an interaction
between iscsi and, maybe, ccd driver... And I cannot remove functions
from this server to test.
I have to fix this issue, but why ? I don't have any usable
Main Index |
Thread Index |