tech-net archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: bnx(4) lockups?



On 2019/03/23 11:32, Masanobu SAITOH wrote:
On 2019/03/22 2:14, Stephen Borrill wrote:
On Wed, 20 Mar 2019, Masanobu SAITOH wrote:
Hi, Stephen.

On 2019/03/19 22:27, Stephen Borrill wrote:
I have a reliable installation on one Lenovo server which has wm(4) NICs. It has connections to the LAN and directly to the open Internet. I tried to migrate it to a different server which has bnx(4) NICs. If not connected to the Internet, it runs reliably. If connected to the Internet and traffic is flowing, it will lock solid after a while. I have encountered this previously and tracked it down to IPFilter. My workaround on that occasion was to use "pass in all/pass out all" rules (I could not disable IPFilter as I needed NAT) as anything more complex caused the lock. However, on this occasion I switched to NPF and the lockups continue. This is on netbsd-7.

The hardware has been running XenServer solidly under heavy load for a number of years. Doing internal data copies (to stress the HDDs and RAID controller) is also reliable.

I really can't think of much else besides the network hardware (and I don't have any spare NICs to hand). Is bnx(4) known bad?
[snip]

Could you try the following patch? This is taken from OpenBSD rev. 1.93.
Just guess. Not tested yet.

Unfortunately, it still locked up with this patch.

:-(

  I will try to reproduce the hard hang next week.

  Our bnx(4) driver is based on OpenBSD and have not pulled changes
from it for many years. And also we can see the difference from FreeBSD's
bce(4) by doing "s/bce/bnx/". I'm going to take some changes from other
BSD's and it might fix the problem.

  From my experience in bge(4), I suspect the bnx(4)'s hard hang might
come from access conflict between the driver(CPU) and the embedded
controller.

One question.

bnx0 at pci1 dev 0 function 0: Broadcom NetXtreme II BCM5709 1000Base-T
bnx0: Ethernet address 5c:f3:fc:e4:e6:78
bnx0: interrupting at ioapic1 pin 4
bnx0: ASIC BCM5709 C0 (0x57092003)
bnx0: PCIe x2 5Gbps
bnx0: Coal (RX:6,6,18,18; TX:20,20,80,80)
bnx1 at pci1 dev 0 function 1: Broadcom NetXtreme II BCM5709 1000Base-T
bnx1: Ethernet address 5c:f3:fc:e4:e6:7a
bnx1: interrupting at ioapic1 pin 16
bnx1: ASIC BCM5709 C0 (0x57092003)
bnx1: PCIe x2 5Gbps
bnx1: Coal (RX:6,6,18,18; TX:20,20,80,80)
bnx2 at pci2 dev 0 function 0: Broadcom NetXtreme II BCM5709 1000Base-T
bnx2: Ethernet address 5c:f3:fc:6b:c6:b4
bnx2: interrupting at ioapic1 pin 5
bnx2: ASIC BCM5709 C0 (0x57092003)
bnx2: PCIe x2 5Gbps
bnx2: Coal (RX:6,6,18,18; TX:20,20,80,80)
bnx3 at pci2 dev 0 function 1: Broadcom NetXtreme II BCM5709 1000Base-T
bnx3: Ethernet address 5c:f3:fc:6b:c6:b6
bnx3: interrupting at ioapic1 pin 17
bnx3: ASIC BCM5709 C0 (0x57092003)
bnx3: PCIe x2 5Gbps
bnx3: Coal (RX:6,6,18,18; TX:20,20,80,80)

No MII PHYs?

Could you show me:

	0) the dmesg output of the PHYs if available.

	1) ifconfig -m

	2) pcictl pci0 dump -b [12] -d 0 -f [01]

Thanks.

--
-----------------------------------------------
                SAITOH Masanobu (msaitoh%execsw.org@localhost
                                 msaitoh%netbsd.org@localhost)


Home | Main Index | Thread Index | Old Index