Current-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: wm0 panic



Hello, all.

On 2020/07/06 17:03, Masanobu SAITOH wrote:
Hi, all.

On 2020/06/29 12:53, Kengo NAKAHARA wrote:
Hi,

On 2020/06/28 0:24, Patrick Welche wrote:
Trying a today's -current/amd64 with DIAGNOSTIC/DEBUG/LOCKDEBUG, I can
boot multiuser without a network. If I log in as root, as soon as I hit
enter:

# ifconfig wm0 inet 10.0.0.62 netmask 0xffffff00
[ 127.5763268] Kernel lock error 127.5763268] lock address : 0xffffffff8106ab40 type     :               spin
[ 127.5863237] initialized  : 0xffffffff80b0bbb9
[ 127.5863237] shared holds :                  0 exclusive:                  1
[ 127.5963238] shares wanted:                  0 exclusive:                  1
[ 127.6063236] relevant cpu :                  1 last held:                  0
[ 127.6163235] relevant lwp : 0xffff8d419a07f20
[ 127.6163235] last locked* : 0xffffffff80a7d2f5 unlocked : 0xffffffff80a7d2e6
[ 127.6263235] curcpu holds :                  0 wanted by: 0xffff8d419a07f200
[ 127.6363234] panic: LOCKDEBock,244: spinout
[ 127.6363234] cpu1: Begin traceback...
[ 127.6463233] vpanic() at netbsd:vpanic+0x152
[ 127.6463233] snprintf() at netbsd:snprintf
[ 127.6563232] lockdebug_more() at netbsd:lockdebug_more
[ 127.6563232] _kernel_lock() at netbsd:_kernel_lock+0x244
[ 127.6663231] ip_slowtimo() at netbsd:ip_slowtimo+0x1a
[ 127.6763231] pfslowtimo() at netbsd:pfslowtimo+0x34
[ 127.6763231] callout_softclock() at netbsd:callout_softclock+0x10f
[ 127.6863230] softint_disph+0x108
[ 127.6863230] DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffffa4825d02eff0
[ 127.6963230] Xsoftintr() at netbsd:Xsoftintr+0x4f
[ 127.7063229] --- interrupt ---
[ 127.706322traceback...
It seems some other code have held KERNEL_LOCK too long time.
Could you show the function of last locked address?
# e.g. addr2line -e "your kernel image" -f 0xffffffff80a7d2f5

If the panic can reappear, could you show "show all locks/t" of ddb?


Thanks,

It seems this problem is the same as the following mail:

	http://mail-index.netbsd.org/current-users/2020/06/03/msg038785.html


Me, and Patrick tried to find the root cause of this problem off-list.
(Note that I can't reproduce the same problem on my machines)

Every time Patrick got this problem, ifconfig had tried to take
module_hook.mtx which was initialized in module_hook_init().
It seems ddb can't trace the stack of ifconfig. Was the process
in a module? Is there a way to know what's happening in it?


------------------
[  22.3977608] Kernel lock error: _kernel_lock,244: spinout

[  22.4077593] lock address : 0xffffffff818a9600 type     :               spin
[  22.4177593] initialized  : 0xffffffff80e7e7e9
[  22.4277592] shared holds :                  0 exclusive:                  1
[  22.4477593] shares wanted:                  0 exclusive:                  2
[  22.4577592] relevant cpu :                  6 last held:                  0
[  22.4677593] relevant lwp : 0xffff850788f52140 last held: 0xffff85078a413b40
[  22.4877595] last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
[  22.4977596] curcpu holds :                  0 wanted by: 0xffff850788f52140

[  22.5077596] panic: LOCKDEBUG: Kernel lock error: _kernel_lock,244: spinout
[  22.5277594] cpu6: Begin traceback...
[  22.5377595] vpanic() at netbsd:vpanic+0x152
[  22.5377595] snprintf() at netbsd:snprintf
[  22.5477594] lockdebug_more() at netbsd:lockdebug_more
[  22.5577621] _kernel_lock() at netbsd:_kernel_lock+0x244
[  22.5777595] frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
[  22.5877596] pffasttimo() at netbsd:pffasttimo+0x34
[  22.5977596] callout_softclock() at netbsd:callout_softclock+0x10f
[  22.6077597] softint_dispatch() at netbsd:softint_dispatch+0x108
[  22.6177598] DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
[  22.6377596] Xsoftintr() at netbsd:Xsoftintr+0x4f
[  22.6377596] --- interrupt ---
[  22.6477596] d787ba705305aa3:
[  22.6577597] cpu6: End traceback...
[  22.6577597] fatal breakpoint trap in supervisor mode
[  22.6677597] trap type 1 code 0 rip 0xffffffff8022098d cs 0x8 rflags 0x202 cr2 0 ilevel 0x2 rsp 0xffff96025d0d1d70
[  22.6877599] curlwp 0xffff850788f52140 pid 0.118 lowest kstack 0xffff96025d0cd2c0
Stopped in pid 0.118 (system) at        netbsd:breakpoint+0x5:  leave
breakpoint() at netbsd:breakpoint+0x5
vpanic() at netbsd:vpanic+0x152
snprintf() at netbsd:snprintf
lockdebug_more() at netbsd:lockdebug_more
_kernel_lock() at netbsd:_kernel_lock+0x244
frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
pffasttimo() at netbsd:pffasttimo+0x34
callout_softclock() at netbsd:callout_softclock+0x10f
softint_dispatch() at netbsd:softint_dispatch+0x108
DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
Xsoftintr() at netbsd:Xsoftintr+0x4f
--- interrupt ---
d787ba705305aa3:
ds          1d80
es          1d30
fs          1d70
gs          10
rdi         2
rsi         2d5
rbp         ffff96025d0d1d70
rbx         104
rdx         1
rcx         2
rax         0
r8          104
r9          0
r10         ffff96025d0d1cf0
r11         10
r12         ffffffff81467b48
r13         ffff96025d0d1db8
r14         ffffffff8145e9ce
r15         1
rip         ffffffff8022098d    breakpoint+0x5
cs          8
rflags      202
rsp         ffff96025d0d1d70
ss          10
netbsd:breakpoint+0x5:  leave
db{6}> set $lines=0
$lines          18 = 0
db{6}> show all locks/t
[Locks tracked through LWPs]

****** LWP 507.507 (ifconfig) @ 0xffff85078a413b40, l_stat=7

*** Locks held:

* Lock 0 (initialized at if_initialize)
lock address : 0xffff850789377400 type     :     sleep/adaptive
initialized  : 0xffffffff80db351d
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  0
relevant cpu :                  0 last held:                  0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db808e unlocked : 0xffffffff80db8165
owner field  : 0xffff85078a413b40 wait/spin:                0/0
Turnstile: no active turnstile for this lock.

* Lock 1 (initialized at in6_init)
lock address : 0xffffffff818fa890 type     :     sleep/adaptive
initialized  : 0xffffffff80a1715b
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  0
relevant cpu :                  0 last held:                  0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80a17e76 unlocked : 000000000000000000
owner field  : 0xffff85078a413b40 wait/spin:                0/0
Turnstile: no active turnstile for this lock.

*** Locks wanted:

* Lock 0 (initialized at module_hook_init)
lock address : 0xffffffff818a9700 type     :     sleep/adaptive
initialized  : 0xffffffff80cb8dae
shared holds :                  0 exclusive:                  0
shares wanted:                  0 exclusive:                  0
relevant cpu :                  0 last held:                  0
relevant lwp : 0xffff85078a413b40 last held: 000000000000000000
last locked  : 000000000000000000 unlocked*: 000000000000000000
owner field  : 000000000000000000 wait/spin:                0/0
Turnstile: no active turnstile for this lock.

*** Traceback:

trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa


****** LWP 0.401 (iic0) @ 0xffff850789981600, l_stat=7

*** Locks held: none

*** Locks wanted:

* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type     :               spin
initialized  : 0xffffffff80e7e7e9
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  2
relevant cpu :                  3 last held:                  0
relevant lwp : 0xffff850789981600 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds :                  0 wanted by: 0xffff850788f52140

*** Traceback:

trace: pid 0 lid 401 at 0xffff96025da38ec0
sleepq_block() at netbsd:sleepq_block+0x211
iic_smbus_intr_thread() at netbsd:iic_smbus_intr_thread+0x52


****** LWP 0.118 (softclk/6) @ 0xffff850788f52140, l_stat=7

*** Locks held:

* Lock 0 (initialized at soinit)
lock address : 0xffff850e68deb080 type     :     sleep/adaptive
initialized  : 0xffffffff80d42fc1
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  0
relevant cpu :                  6 last held:                  6
relevant lwp : 0xffff850788f52140 last held: 0xffff850788f52140
last locked* : 0xffffffff80a106bf unlocked : 0xffffffff80d43a05
owner field  : 0xffff850788f52140 wait/spin:                0/0
Turnstile: no active turnstile for this lock.

*** Locks wanted:

* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type     :               spin
initialized  : 0xffffffff80e7e7e9
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  2
relevant cpu :                  6 last held:                  0
relevant lwp : 0xffff850788f52140 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds :                  0 wanted by: 0xffff850788f52140

*** Traceback:

trace: pid 0 lid 118 at 0xffff96025d0d1d70
breakpoint() at netbsd:breakpoint+0x5
vpanic() at netbsd:vpanic+0x152
snprintf() at netbsd:snprintf
lockdebug_more() at netbsd:lockdebug_more
_kernel_lock() at netbsd:_kernel_lock+0x244
frag6_fasttimo() at netbsd:frag6_fasttimo+0x1a
pffasttimo() at netbsd:pffasttimo+0x34
callout_softclock() at netbsd:callout_softclock+0x10f
softint_dispatch() at netbsd:softint_dispatch+0x108
DDB lost frame for netbsd:Xsoftintr+0x4f, trying 0xffff96025d0d1ff0
Xsoftintr() at netbsd:Xsoftintr+0x4f
--- interrupt ---
d787ba705305aa3:


[Locks tracked through CPUs]

******* Locks held on cpu0:

* Lock 0 (initialized at main)
lock address : 0xffffffff818a9600 type     :               spin
initialized  : 0xffffffff80e7e7e9
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  2
relevant cpu :                  0 last held:                  0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80db867b unlocked : 0xffffffff80db866c
curcpu holds :                  0 wanted by: 0xffff850788f52140
trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa

* Lock 1 (initialized at ifmedia_init_with_lock)
lock address : 0xffff850789377380 type     :               spin
initialized  : 0xffffffff80dd14c2
shared holds :                  0 exclusive:                  1
shares wanted:                  0 exclusive:                  0
relevant cpu :                  0 last held:                  0
relevant lwp : 0xffff85078a413b40 last held: 0xffff85078a413b40
last locked* : 0xffffffff80dd124f unlocked : 0xffffffff803325fb
owner field  : 0x0000000000010600 wait/spin:                0/1
trace: pid 507 lid 507 at 0xffff96026e1d4990
lockdebug_unlocked() at netbsd:lockdebug_unlocked+0x10b
comcnputc() at netbsd:comcnputc+0xaa
comcnputc() at netbsd:comcnputc+0xaa

------------------

--
-----------------------------------------------
                SAITOH Masanobu (msaitoh%execsw.org@localhost
                                 msaitoh%netbsd.org@localhost)


Home | Main Index | Thread Index | Old Index