NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/55683: ixgX (X550) sometimes stop receiving



On 2020/09/25 22:30, kardel%netbsd.org@localhost wrote:
> 	Note: There was a bad value `' for the field `Confidential'.
> 	It was set to the default value of `yes'.
> 
>> Number:         55683
>> Category:       kern
>> Synopsis:       ixgX (X550) sometimes stop receiving
>> Confidential:   yes
>> Severity:       critical
>> Priority:       high
>> Responsible:    kern-bug-people
>> State:          open
>> Class:          sw-bug
>> Submitter-Id:   net
>> Arrival-Date:   Fri Sep 25 13:30:00 +0000 2020
>> Originator:     Frank Kardel
>> Release:        NetBSD 9.99.72
>> Organization:
> 	
>> Environment:
> 	
> 	
> System: NetBSD gateway 9.99.72 NetBSD 9.99.72 (GATEWAY) #7: Thu Sep 10 13:58:53 UTC 2020 kardel@gateway:/src/NetBSD/cur/src/obj.amd64/sys/arch/amd64/compile/GATEWAY amd64
> Architecture: x86_64
> Machine: amd64
>> Description:
> 	I am observing receive outages on my X550 interfaces. Every once in a while
> 	one of the 4 interface becomes deaf. I see very high input error counts on
> 	these interfaces. An ifconfig down/up brings the interfaces back to life.
> 
> State observed when no reception is done:
> 
> # sysctl -a
> net.interfaces.ixg3.rcvq.drops = 0
> net.interfaces.ixg3.sndq.len = 0
> net.interfaces.ixg3.sndq.maxlen = 2046
> net.interfaces.ixg3.sndq.drops = 0
> hw.acpi.wake.ixg3 = 0
> hw.ixg3.rx_processing_limit = 256
> hw.ixg3.tx_processing_limit = 256
> hw.ixg3.debug = 0
> hw.ixg3.num_rx_desc = 2048
> hw.ixg3.num_queues = 4
> hw.ixg3.fc = 0
> hw.ixg3.enable_aim = 1
> hw.ixg3.advertise_speed = 0
> hw.ixg3.txrx_workqueue = 0
> hw.ixg3.dmac = 0
> hw.ixg3.wol_enable = 1
> hw.ixg3.wufc = 0
> hw.ixg3.force_10_100_autonego = 0
> hw.ixg3.eee_state = 1
> hw.ixg3.q0.interrupt_rate = 83333
> hw.ixg3.q0.txd_head = 27
> hw.ixg3.q0.txd_tail = 27
> hw.ixg3.q0.rxd_nxck = 1696
> hw.ixg3.q0.rxd_head = 1696
> hw.ixg3.q0.rxd_tail = 1695
> hw.ixg3.q1.interrupt_rate = 100000
> hw.ixg3.q1.txd_head = 547
> hw.ixg3.q1.txd_tail = 547
> hw.ixg3.q1.rxd_nxck = 1568
> hw.ixg3.q1.rxd_head = 1567
> hw.ixg3.q1.rxd_tail = 1567
> hw.ixg3.q2.interrupt_rate = 35714
> hw.ixg3.q2.txd_head = 870
> hw.ixg3.q2.txd_tail = 870
> hw.ixg3.q2.rxd_nxck = 386
> hw.ixg3.q2.rxd_head = 386
> hw.ixg3.q2.rxd_tail = 385
> hw.ixg3.q3.interrupt_rate = 31250
> hw.ixg3.q3.txd_head = 359
> hw.ixg3.q3.txd_tail = 359
> hw.ixg3.q3.rxd_nxck = 1715
> hw.ixg3.q3.rxd_head = 1715
> hw.ixg3.q3.rxd_tail = 1714
> 
> # ifconfig -vm ixg3
> ixg3: flags=0x8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
> 	capabilities=0x7ff80<TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx>
> 	capabilities=0x7ff80<TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx>
> 	capabilities=0x7ff80<TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6>
> 	enabled=0x7ff80<TSO4,IP4CSUM_Rx,IP4CSUM_Tx,TCP4CSUM_Rx>
> 	enabled=0x7ff80<TCP4CSUM_Tx,UDP4CSUM_Rx,UDP4CSUM_Tx,TCP6CSUM_Rx>
> 	enabled=0x7ff80<TCP6CSUM_Tx,UDP6CSUM_Rx,UDP6CSUM_Tx,TSO6>
> 	ec_capabilities=0xf<VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU,VLAN_HWFILTER>
> 	ec_enabled=0x7<VLAN_MTU,VLAN_HWTAGGING,JUMBO_MTU>
> 	address: X----X
> 	media: Ethernet autoselect (1000baseT full-duplex)
> 	status: active
> 	supported Ethernet media:
> 		media none
> 		media 1000baseT mediaopt full-duplex
> 		media 100baseTX mediaopt full-duplex
> 		media 10baseT mediaopt full-duplex
> 		media autoselect
> 	input: 26783760 packets, 17739001536 bytes, 4049916 multicasts, 1214457 errors, 45046 queue drops
> 	output: 20557722 packets, 10152762352 bytes, 320814 multicasts
> 	inet xx.xx.xx.34/29 broadcast xx.xx.xx.39 flags 0
> 	inet xx.xx.xx.35/32 broadcast xx.xx.xx.35 flags 0
> 	inet xx.xx.xx.36/32 broadcast xx.xx.xx.36 flags 0
> 	inet xx.xx.xx.37/32 broadcast xx.xx.xx.37 flags 0
> 	inet6 fe80::ec4:7aff:fefa:3979%ixg3/64 flags 0 scopeid 0x4
> 
> # intrctl list
> interrupt id       CPU0      CPU1      CPU2     CPU3  device name(s)
> ioapic0 pin 9         0*        0         0        0  acpi SCI
> ioapic0 pin 4         0*        0         0        0  com0
> ioapic0 pin 3   2155553*        0         0        0  com1
> msi0 vec 0            0*        0         0        0  ismt0
> msix1 vec 0           0*        0         0        0  ahcisata0 port0
> msix1 vec 1           0*        0         0        0  ahcisata0 port1
> msix1 vec 2           0*        0         0        0  ahcisata0 port2
> msix1 vec 3           0*        0         0        0  ahcisata0 port3
> msix2 vec 5    56028281*        0         0        0  ahcisata1 port5
> msix2 vec 6           0*        0         0        0  ahcisata1 port6
> msi3 vec 0          276*        0         0        0  xhci0
> msix4 vec 0    36162181*        0         0        0  ixg0 TXRX0
> msix4 vec 1           0  10552694*        0        0  ixg0 TXRX1
> msix4 vec 2           0         0  13329746*       0  ixg0 TXRX2
> msix4 vec 3           0         0         0  9439699* ixg0 TXRX3
> msix4 vec 4           1*        0         0        0  ixg0 link
> msix5 vec 0     7496431*        0         0        0  ixg1 TXRX0
> msix5 vec 1           0   1226473*        0        0  ixg1 TXRX1
> msix5 vec 2           0         0    700512*       0  ixg1 TXRX2
> msix5 vec 3           0         0         0  1770846* ixg1 TXRX3
> msix5 vec 4           1*        0         0        0  ixg1 link
> msix6 vec 0     1015066*        0         0        0  ixg2 TXRX0
> msix6 vec 1           0    246663*        0        0  ixg2 TXRX1
> msix6 vec 2           0         0    209309*       0  ixg2 TXRX2
> msix6 vec 3           0         0         0   138600* ixg2 TXRX3
> msix6 vec 4           1*        0         0        0  ixg2 link
> msix7 vec 0    23012456*        0         0        0  ixg3 TXRX0
> msix7 vec 1           0   5872758*        0        0  ixg3 TXRX1
> msix7 vec 2           0         0   3033676*       0  ixg3 TXRX2
> msix7 vec 3           0         0         0  5857198* ixg3 TXRX3
> msix7 vec 4           3*        0         0        0  ixg3 link
> ioapic0 pin 23        0*        0         0        0  ichsmb0
> 
> # uptime
>  1:07PM  up 14 days,  1:17, 1 user, load averages: 0.11, 0.11, 0.12
> 
> # dmesg (shortened)
> Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
>     2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
>     2018, 2019, 2020 The NetBSD Foundation, Inc.  All rights reserved.
> Copyright (c) 1982, 1986, 1989, 1991, 1993
>     The Regents of the University of California.  All rights reserved.
> 
> NetBSD 9.99.72 (GATEWAY) #7: Thu Sep 10 13:58:53 UTC 2020
> 	kardel@gateway:/src/NetBSD/cur/src/obj.amd64/sys/arch/amd64/compile/GATEWAY
> ...
> ppb5 at pci0 dev 22 function 0: Intel C3000 PCIe Root Port for LAN 0 (rev. 0x11)
> ppb5: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 2.5GT/s
> pci6 at ppb5 bus 6
> pci6: i/o space, memory space enabled, rd/line, wr/inv ok
> ixg0 at pci6 dev 0 function 0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 4.0.1-k
> ixg0: clearing prefetchable bit
> ixg0: device X550EM A
> ixg0: NVM Image Version 0.05 ID 0x8, NVM Map version 1.16, OEM NVM Image version 0.06, ETrackID 80000877
> ixg0: for TX/RX, interrupting at msix4 vec 0, bound queue 0 to cpu 0
> allocated pic msix4 type edge pin 1 level 6 to cpu1 slot 0 idt entry 96
> ixg0: for TX/RX, interrupting at msix4 vec 1, bound queue 1 to cpu 1
> allocated pic msix4 type edge pin 2 level 6 to cpu1 slot 1 idt entry 97
> ixg0: for TX/RX, interrupting at msix4 vec 2, bound queue 2 to cpu 2
> allocated pic msix4 type edge pin 3 level 6 to cpu1 slot 1 idt entry 97
> ixg0: for TX/RX, interrupting at msix4 vec 3, bound queue 3 to cpu 3
> allocated pic msix4 type edge pin 4 level 6 to cpu1 slot 1 idt entry 97
> ixg0: for link, interrupting at msix4 vec 4, affinity to cpu 0
> ixg0: Using MSI-X interrupts with 5 vectors
> ixg0: Ethernet address X----X
> ixg0: PHY OUI 0x000ac2, model 0x002a, rev. 0
> ixg0: feature cap 0xfa0<TEMP_SENSOR,LEGACY_TX,FDIR,MSI,MSIX,EEE>
> ixg0: feature ena 0xc20<TEMP_SENSOR,MSIX,EEE>
> ixg1 at pci6 dev 0 function 1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 4.0.1-k
> ixg1: clearing prefetchable bit
> ixg1: device X550EM A
> ixg1: NVM Image Version 0.05 ID 0x8, NVM Map version 1.16, OEM NVM Image version 0.06, ETrackID 80000877
> allocated pic msix5 type edge pin 0 level 6 to cpu1 slot 1 idt entry 97
> ixg1: for TX/RX, interrupting at msix5 vec 0, bound queue 0 to cpu 0
> allocated pic msix5 type edge pin 1 level 6 to cpu1 slot 1 idt entry 97
> ixg1: for TX/RX, interrupting at msix5 vec 1, bound queue 1 to cpu 1
> allocated pic msix5 type edge pin 2 level 6 to cpu1 slot 2 idt entry 98
> ixg1: for TX/RX, interrupting at msix5 vec 2, bound queue 2 to cpu 2
> allocated pic msix5 type edge pin 3 level 6 to cpu1 slot 2 idt entry 98
> ixg1: for TX/RX, interrupting at msix5 vec 3, bound queue 3 to cpu 3
> allocated pic msix5 type edge pin 4 level 6 to cpu1 slot 2 idt entry 98
> ixg1: for link, interrupting at msix5 vec 4, affinity to cpu 0
> ixg1: Using MSI-X interrupts with 5 vectors
> ixg1: Ethernet address X----X
> ixg1: PHY OUI 0x000ac2, model 0x002a, rev. 0
> ixg1: feature cap 0xfa0<TEMP_SENSOR,LEGACY_TX,FDIR,MSI,MSIX,EEE>
> ixg1: feature ena 0xc20<TEMP_SENSOR,MSIX,EEE>
> ppb6 at pci0 dev 23 function 0: Intel C3000 PCIe Root Port for LAN 1 (rev. 0x11)
> ppb6: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 2.5GT/s
> pci7 at ppb6 bus 7
> pci7: i/o space, memory space enabled, rd/line, wr/inv ok
> ixg2 at pci7 dev 0 function 0: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 4.0.1-k
> ixg2: clearing prefetchable bit
> ixg2: device X550EM A
> ixg2: NVM Image Version 0.05 ID 0x8, NVM Map version 1.16, OEM NVM Image version 0.06, ETrackID 8000087c
> allocated pic msix6 type edge pin 0 level 6 to cpu1 slot 2 idt entry 98
> ixg2: for TX/RX, interrupting at msix6 vec 0, bound queue 0 to cpu 0
> allocated pic msix6 type edge pin 1 level 6 to cpu1 slot 2 idt entry 98
> ixg2: for TX/RX, interrupting at msix6 vec 1, bound queue 1 to cpu 1
> allocated pic msix6 type edge pin 2 level 6 to cpu1 slot 3 idt entry 99
> ixg2: for TX/RX, interrupting at msix6 vec 2, bound queue 2 to cpu 2
> allocated pic msix6 type edge pin 3 level 6 to cpu1 slot 3 idt entry 99
> ixg2: for TX/RX, interrupting at msix6 vec 3, bound queue 3 to cpu 3
> allocated pic msix6 type edge pin 4 level 6 to cpu1 slot 3 idt entry 99
> ixg2: for link, interrupting at msix6 vec 4, affinity to cpu 0
> ixg2: Using MSI-X interrupts with 5 vectors
> ixg2: Ethernet address X----X
> ixg2: PHY OUI 0x000ac2, model 0x002a, rev. 0
> ixg2: feature cap 0xfa0<TEMP_SENSOR,LEGACY_TX,FDIR,MSI,MSIX,EEE>
> ixg2: feature ena 0xc20<TEMP_SENSOR,MSIX,EEE>
> ixg3 at pci7 dev 0 function 1: Intel(R) PRO/10GbE PCI-Express Network Driver, Version - 4.0.1-k
> ixg3: clearing prefetchable bit
> ixg3: device X550EM A
> ixg3: NVM Image Version 0.05 ID 0x8, NVM Map version 1.16, OEM NVM Image version 0.06, ETrackID 8000087c
> allocated pic msix7 type edge pin 0 level 6 to cpu1 slot 3 idt entry 99
> ixg3: for TX/RX, interrupting at msix7 vec 0, bound queue 0 to cpu 0
> allocated pic msix7 type edge pin 1 level 6 to cpu1 slot 3 idt entry 99
> ixg3: for TX/RX, interrupting at msix7 vec 1, bound queue 1 to cpu 1
> allocated pic msix7 type edge pin 2 level 6 to cpu1 slot 4 idt entry 100
> ixg3: for TX/RX, interrupting at msix7 vec 2, bound queue 2 to cpu 2
> allocated pic msix7 type edge pin 3 level 6 to cpu1 slot 4 idt entry 100
> ixg3: for TX/RX, interrupting at msix7 vec 3, bound queue 3 to cpu 3
> allocated pic msix7 type edge pin 4 level 6 to cpu1 slot 4 idt entry 100
> ixg3: for link, interrupting at msix7 vec 4, affinity to cpu 0
> ixg3: Using MSI-X interrupts with 5 vectors
> ixg3: Ethernet address X----X
> ixg3: PHY OUI 0x000ac2, model 0x002a, rev. 0
> ixg3: feature cap 0xfa0<TEMP_SENSOR,LEGACY_TX,FDIR,MSI,MSIX,EEE>
> ixg3: feature ena 0xc20<TEMP_SENSOR,MSIX,EEE>
>> How-To-Repeat:
> 	Run current or older (I see this for ~1.5 years now) on a 
> 	SuperMicro C3000 platform.
>> Fix:
> 	? - analyse error conditions?
> 
>> Unformatted:
>  	
>  	
> 

Could you show me the output of "vmstat -ev | grep ixg"?

-- 
-----------------------------------------------
                SAITOH Masanobu (msaitoh%execsw.org@localhost
                                 msaitoh%netbsd.org@localhost)


Home | Main Index | Thread Index | Old Index