Current-Users archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: wm0 panic
Hello, all.
On 2020/07/06 17:03, Masanobu SAITOH wrote:
Hi, all.
On 2020/06/29 12:53, Kengo NAKAHARA wrote:
Hi,
On 2020/06/28 0:24, Patrick Welche wrote:
Trying a today's -current/amd64 with DIAGNOSTIC/DEBUG/LOCKDEBUG, I can
boot multiuser without a network. If I log in as root, as soon as I hit
enter:
# ifconfig wm0 inet 10.0.0.62 netmask 0xffffff00
[ 127.5763268] Kernel lock error 127.5763268] lock address : 0xffffffff8106ab40 type : spin
[ 127.5863237] initialized : 0xffffffff80b0bbb9
[ 127.5863237] shared holds : 0 exclusive: 1
[ 127.5963238] shares wanted: 0 exclusive: 1
[ 127.6063236] relevant cpu : 1 last held: 0
[ 127.6163235] relevant lwp : 0xffff8d419a07f20
[ 127.6163235] last locked* : 0xffffffff80a7d2f5 unlocked : 0xffffffff80a7d2e6
[ 127.6263235] curcpu holds : 0 wanted by: 0xffff8d419a07f200
[ 127.6363234] panic: LOCKDEBock,244: spinout
[ 127.6363234] cpu1: Begin traceback...
[ 127.6463233] vpanic() at netbsd:vpanic+0x152
[ 127.6463233] snprintf() at netbsd:snprintf
[ 127.6563232] lockdebug_more() at netbsd:lockdebug_more
[ 127.6563232] _kernel_lock() at netbsd:_kernel_lock+0x244
[ 127.6663231] ip_slowtimo() at netbsd:ip_slowtimo+0x1a
[ 127.6763231] pfslowtimo() at netbsd:pfslowtimo+0x34
[ 127.6763231] callout_softclock() at netbsd:callout_softclock+0x10f
[ 127.6863230] softint_disph+0x108
[ 127.6863230] DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffffa4825d02eff0
[ 127.6963230] Xsoftintr() at netbsd:Xsoftintr+0x4f
[ 127.7063229] --- interrupt ---
[ 127.706322traceback...
It seems some other code have held KERNEL_LOCK too long time.
Could you show the function of last locked address?
# e.g. addr2line -e "your kernel image" -f 0xffffffff80a7d2f5
If the panic can reappear, could you show "show all locks/t" of ddb?
Thanks,
It seems this problem is the same as the following mail:
http://mail-index.netbsd.org/current-users/2020/06/03/msg038785.html
Me, and Patrick tried to find the root cause of this problem off-list.
(Note that I can't reproduce the same problem on my machines)
Every time Patrick got this problem, ifconfig had tried to take
module_hook.mtx which was initialized in module_hook_init().
It seems ddb can't trace the stack of ifconfig. Was the process
in a module? Is there a way to know what's happening in it?
------------------
[ 22.3977608] Kernel lock error: _kernel_lock,244: spinout
[ 22.4077593] lock address : 0xffffffff818a9600 type : spin
[ 22.4177593] initialized : 0xffffffff80e7e7e9
[ 22.4277592] shared holds : 0 exclusive: 1
[ 22.4477593] shares wanted: 0 exclusive: 2
[ 22.4577592] relevant cpu : 6 last held: 0
[ 22.4677593] relevant lwp : 0xffff850788f52140 last held: 0xffff85078a413b40
[ 22.4877595] last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
[ 22.4977596] curcpu holds : 0 wanted by: 0xffff850788f52140
[ 22.5077596] panic: LOCKDEBUG: Kernel lock error: _kernel_lock,244: spinout
[ 22.5277594] cpu6: Begin traceback...
[ 22.5377595] vpanic() at netbsd:vpanic+0x152
[ 22.5377595] snprintf() at netbsd:snprintf
[ 22.5477594] lockdebug_more() at netbsd:lockdebug_more
[ 22.5577621] _kernel_lock() at netbsd:_kernel_lock+0x244
[ 22.5777595] frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
[ 22.5877596] pffasttimo() at netbsd:pffasttimo+0x34
[ 22.5977596] callout_softclock() at netbsd:callout_softclock+0x10f
[ 22.6077597] softint_dispatch() at netbsd:softint_dispatch+0x108
[ 22.6177598] DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
[ 22.6377596] Xsoftintr() at netbsd:Xsoftintr+0x4f
[ 22.6377596] --- interrupt ---
[ 22.6477596] d787ba705305aa3:
[ 22.6577597] cpu6: End traceback...
[ 22.6577597] fatal breakpoint trap in supervisor mode
[ 22.6677597] trap type 1 code 0 rip 0xffffffff8022098d cs 0x8 rflags 0x202 cr2 0 ilevel 0x2 rsp 0xffff96025d0d1d70
[ 22.6877599] curlwp 0xffff850788f52140 pid 0.118 lowest kstack 0xffff96025d0cd2c0
Stopped in pid 0.118 (system) at netbsd:breakpoint+0x5: leave
breakpoint() at netbsd:breakpoint+0x5
vpanic() at netbsd:vpanic+0x152
snprintf() at netbsd:snprintf
lockdebug_more() at netbsd:lockdebug_more
_kernel_lock() at netbsd:_kernel_lock+0x244
frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
pffasttimo() at netbsd:pffasttimo+0x34
callout_softclock() at netbsd:callout_softclock+0x10f
softint_dispatch() at netbsd:softint_dispatch+0x108
DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
Xsoftintr() at netbsd:Xsoftintr+0x4f
--- interrupt ---
d787ba705305aa3:
ds 1d80
es 1d30
fs 1d70
gs 10
rdi 2
rsi 2d5
rbp ffff96025d0d1d70
rbx 104
rdx 1
rcx 2
rax 0
r8 104
r9 0
r10 ffff96025d0d1cf0
r11 10
r12 ffffffff81467b48
r13 ffff96025d0d1db8
r14 ffffffff8145e9ce
r15 1
rip ffffffff8022098d breakpoint+0x5
cs 8
rflags 202
rsp ffff96025d0d1d70
ss 10
netbsd:breakpoint+0x5: leave
db{6}> set $lines=0
$lines 18 = 0
db{6}> show all locks/t
[Locks tracked through LWPs]
****** LWP 507.507 (ifconfig) @ 0xffff85078a413b40, l_stat=7
*** Locks held:
* Lock 0 (initialized at if_initialize)
lock address : 0xffff850789377400 type : sleep/adaptive
initialized : 0xffffffff80db351d
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 0
relevant cpu : 0 last held: 0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db808e unlocked : 0xffffffff80db8165
owner field : 0xffff85078a413b40 wait/spin: 0/0
Turnstile: no active turnstile for this lock.
* Lock 1 (initialized at in6_init)
lock address : 0xffffffff818fa890 type : sleep/adaptive
initialized : 0xffffffff80a1715b
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 0
relevant cpu : 0 last held: 0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80a17e76 unlocked : 000000000000000000
owner field : 0xffff85078a413b40 wait/spin: 0/0
Turnstile: no active turnstile for this lock.
*** Locks wanted:
* Lock 0 (initialized at module_hook_init)
lock address : 0xffffffff818a9700 type : sleep/adaptive
initialized : 0xffffffff80cb8dae
shared holds : 0 exclusive: 0
shares wanted: 0 exclusive: 0
relevant cpu : 0 last held: 0
relevant lwp : 0xffff85078a413b40 last held: 000000000000000000
last locked : 000000000000000000 unlocked*: 000000000000000000
owner field : 000000000000000000 wait/spin: 0/0
Turnstile: no active turnstile for this lock.
*** Traceback:
trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa
****** LWP 0.401 (iic0) @ 0xffff850789981600, l_stat=7
*** Locks held: none
*** Locks wanted:
* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type : spin
initialized : 0xffffffff80e7e7e9
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 2
relevant cpu : 3 last held: 0
relevant lwp : 0xffff850789981600 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds : 0 wanted by: 0xffff850788f52140
*** Traceback:
trace: pid 0 lid 401 at 0xffff96025da38ec0
sleepq_block() at netbsd:sleepq_block+0x211
iic_smbus_intr_thread() at netbsd:iic_smbus_intr_thread+0x52
****** LWP 0.118 (softclk/6) @ 0xffff850788f52140, l_stat=7
*** Locks held:
* Lock 0 (initialized at soinit)
lock address : 0xffff850e68deb080 type : sleep/adaptive
initialized : 0xffffffff80d42fc1
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 0
relevant cpu : 6 last held: 6
relevant lwp : 0xffff850788f52140 last held: 0xffff850788f52140
last locked* : 0xffffffff80a106bf unlocked : 0xffffffff80d43a05
owner field : 0xffff850788f52140 wait/spin: 0/0
Turnstile: no active turnstile for this lock.
*** Locks wanted:
* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type : spin
initialized : 0xffffffff80e7e7e9
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 2
relevant cpu : 6 last held: 0
relevant lwp : 0xffff850788f52140 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds : 0 wanted by: 0xffff850788f52140
*** Traceback:
trace: pid 0 lid 118 at 0xffff96025d0d1d70
breakpoint() at netbsd:breakpoint+0x5
vpanic() at netbsd:vpanic+0x152
snprintf() at netbsd:snprintf
lockdebug_more() at netbsd:lockdebug_more
_kernel_lock() at netbsd:_kernel_lock+0x244
frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
pffasttimo() at netbsd:pffasttimo+0x34
callout_softclock() at netbsd:callout_softclock+0x10f
softint_dispatch() at netbsd:softint_dispatch+0x108
DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
Xsoftintr() at netbsd:Xsoftintr+0x4f
--- interrupt ---
d787ba705305aa3:
[Locks tracked through CPUs]
******* Locks held on cpu0:
* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type : spin
initialized : 0xffffffff80e7e7e9
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 2
relevant cpu : 0 last held: 0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds : 0 wanted by: 0xffff850788f52140
trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa
* Lock 1 (initialized at ifmedia_init_with_lock)
lock address : 0xffff850789377380 type : spin
initialized : 0xffffffff80dd14c2
shared holds : 0 exclusive: 1
shares wanted: 0 exclusive: 0
relevant cpu : 0 last held: 0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80dd124f unlocked : 0xffffffff803325fb
owner field : 0x0000000000010600 wait/spin: 0/1
trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa
------------------
--
-----------------------------------------------
SAITOH Masanobu (msaitoh%execsw.org@localhost
msaitoh%netbsd.org@localhost)
Home |
Main Index |
Thread Index |
Old Index