NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: kern/53940: WM0 device timeout issue in NetBSD 7.1
The following reply was made to PR kern/53940; it has been noted by GNATS.
From: =?UTF-8?B?SmFyb23DrXIgRG9sZcSNZWs=?= <jaromir.dolecek%gmail.com@localhost>
To: "gnats-bugs%NetBSD.org@localhost" <gnats-bugs%netbsd.org@localhost>
Cc:
Subject: Re: kern/53940: WM0 device timeout issue in NetBSD 7.1
Date: Thu, 14 Feb 2019 20:27:50 +0100
Can you please test with later kernel? There were some changes in 8.0
which might not be in 7.1, and many further changes in -current.
You can download -current kernel from the daily builds and boot it
with your existing userland.
Jaromir
Le lun. 4 f=C3=A9vr. 2019 =C3=A0 06:54, <aravind_m1%dell.com@localhost> a =C3=A9crit =
:
>
> >Number: 53940
> >Category: kern
> >Synopsis: WM0 device timeout issue in NetBSD 7.1
> >Confidential: no
> >Severity: serious
> >Priority: high
> >Responsible: kern-bug-people
> >State: open
> >Class: sw-bug
> >Submitter-Id: net
> >Arrival-Date: Mon Feb 04 05:30:00 +0000 2019
> >Originator: Aravind Mani
> >Release: NetBSD 7.1
> >Organization:
> Dell
> >Environment:
> WM0 timeout issue in NetBSD 7.1
> >Description:
> We use WM_T_I354 chip type.When we reload continuously,we could able to o=
bserve device timeout issue. wm_init(),wm_reset() doesn't help to recover f=
rom problem state.The only way to recover is to reload the switch.There was=
no initialization error.
> >From wm_print_stats() and wm_pkt_stats(),i don't see any value in the re=
gisters listed and the packets are not hitting the hardware.
> wm_reset also didn't help to recover the issue.
> We didn't remove wm_print_stats and wm_pkt_stats in NetBSD 7.1
>
> I've taken the kernel live core last time when we faced this issue and i =
could see that the WM PHY was active.
> WM0 device timeout keeps on piling after wm_reset.
> The management port(WM0)is UP but can't able to ping external network.
> Please provide your view on this issue and do you need any other logs to =
investigate?.
>
>
> logs:
> wm0: device timeout (txfree 4095 txsfree 63 txnext 141)
>
> SStk-1 # vmstat -e
> event total rate type
> bus_dma loads 95451577 319 misc
> vmcmd kills 661 0 misc
> vmcmd calls 3731 0 misc
> vmem static_bt_inuse 200 0 misc
> vmem static_bt_count 200 0 misc
> TLB shootdown 182842 0 intr
> cpu0 runqueue pull 16763601 56 misc
> cpu0 runqueue push 218455 0 misc
> cpu0 runqueue stay 29807214 99 misc
> cpu0 runqueue localize 199719304 669 misc
> softint net/0 1172158 3 misc
> softint net block/0 46424 0 misc
> softint bio/0 6245 0 misc
> softint bio block/0 4 0 misc
> softint clk/0 29819349 99 misc
> softint clk block/0 145137 0 misc
> softint ser/0 44794 0 misc
> callout late/0 38366 0 misc
> crosscall unicast 11 0 misc
> crosscall broadcast 4 0 misc
> namecache entries collected 13850 0 misc
> namecache under scan target 298154 0 misc
> cpu0 timer 29826661 99 intr
> cpu0 generic IPI 548755 1 misc
> cpu0 FPU synch IPI 3116 0 misc
> cpu0 kpreempt IPI 235125 0 misc
> cpu1 runqueue pull 18640375 62 misc
> cpu1 runqueue push 2168053 7 misc
> cpu1 runqueue stay 30124219 100 misc
> cpu1 runqueue localize 158923916 532 misc
> softint net/1 365 0 misc
> softint net block/1 360 0 misc
> softint clk/1 29817170 99 misc
> softint clk block/1 28745 0 misc
> softint ser/1 8658 0 misc
> callout late/1 18516 0 misc
> cpu1 timer 29826661 99 misc
> cpu1 FPU synch IPI 4340 0 misc
> cpu1 kpreempt IPI 173706 0 misc
> ioapic0 pin 20 172536 0 intr
> wm0 txsstall 1088 0 misc
> wm0 txdw 183747 0 intr
> wm0 txseg0 255914 0 misc
> ioapic0 pin 23 18 0 intr
> ioapic0 pin 19 6797 0 intr
> ioapic0 pin 4 33936 0 intr
> kpreempt defer: critical section 7776 0 misc
> kpreempt defer: kernel_lock 2793374 9 misc
> kpreempt immediate 493760 1 misc
>
>
> SStk-1 # sysctl -w ddb.command=3D"call wm_pkt_stats(0)"
> Total Pkts Recv =3D0
> Missed Pkts Recv =3D0
> Good Pkts Recv =3D0
> No Buff Pkts Recv =3D0
> Mgmt Pkt Recv =3D0
> Mgmt Buff Drop Recv =3D0
> Interrupt Assertion =3D80
>
> wm_print_stats:
>
> 0x4000 : 0
> 0x4004 : 0
> 0x4008 : 0
> 0x400c : 0
> 0x4010 : 0
> 0x4014 : 0
> 0x4018 : 0
> 0x401c : 0
> 0x4020 : 0
> 0x4024 : 0
> 0x4028 : 0
> 0x402c : 0
> 0x4030 : 0
> 0x4034 : 0
> 0x4038 : 0
> 0x403c : 0
> 0x4040 : 0
> 0x4044 : 0
> 0x4048 : 0
> 0x404c : 0
> 0x4050 : 0
> 0x4054 : 0
> 0x4058 : 0
> 0x405c : 0
> 0x4060 : 0
> 0x4064 : 0
> 0x4068 : 0
> 0x406c : 0
> 0x4070 : 0
> 0x4074 : 0
> 0x4078 : 0
> 0x407c : 0
> 0x4080 : 0
> 0x4084 : 0
> 0x4088 : 0
> 0x408c : 0
> 0x4090 : 0
> 0x4094 : 0
> 0x4098 : 0
> 0x409c : 0
> 0x40a0 : 0
> 0x40a4 : 0
> 0x40a8 : 0
> 0x40ac : 0
> 0x40b0 : 0
> 0x40b4 : 0
> 0x40b8 : 0
> 0x40bc : 0
> 0x40c0 : 0
> 0x40c4 : 0
> 0x40c8 : 0
> 0x40cc : 0
> 0x40d0 : 0
> 0x40d4 : 0
> 0x40d8 : 0
> 0x40dc : 0
> 0x40e0 : 0
> 0x40e4 : 0
> 0x40e8 : 0
> 0x40ec : 0
> 0x40f0 : 0
> 0x40f4 : 0
> 0x40f8 : 0
> 0x40fc : 0
> 0x4100 : 0x24
> 0x4104 : 0
> 0x4108 : 0
> 0x410c : 0
> 0x4110 : 0
> 0x4114 : 0
> 0x4118 : 0
> 0x411c : 0
> 0x4120 : 0
> 0x4124 : 0
> 0x4128 : 0
> 0x412c : 0
> 0x4130 : 0
> 0x4134 : 0
> 0x4138 : 0
> 0x413c : 0
> 0x4140 : 0
> 0x4144 : 0
> 0x4148 : 0
> 0x414c : 0
> 0x4150 : 0
> 0x4154 : 0
>
>
> >How-To-Repeat:
> Reload the switch continuously.
> >Fix:
>
Home |
Main Index |
Thread Index |
Old Index