Subject: kern/18676: bge times out, and resets
To: None <gnats-bugs@gnats.netbsd.org>
From: None <scotte@warped.com>
List: netbsd-bugs
Date: 10/17/2002 01:58:44
>Number:         18676
>Category:       kern
>Synopsis:       "bge0" periodically times out, and gets reset.
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Wed Oct 16 18:59:00 PDT 2002
>Closed-Date:
>Last-Modified:
>Originator:     Scott Ellis
>Release:        NetBSD 1.6I
>Organization:
   //////////////////////////////////////////////////////////////////////
  //    Scott Ellis     //             scotte@warped.com              //
 //////////////////////////////////////////////////////////////////////
// WARNING: This signature warps  time and space in its vicinity    //
>Environment:
	
	
System: NetBSD intrepid 1.6I NetBSD 1.6I (INTREPID.MP) #0: Mon Oct 14 22:30:35 PDT 2002 scotte@intrepid:/misc/localsrc/netbsd/netbsd_cvs_src/src/sys/arch/i386/compile/INTREPID.MP i386
Architecture: i386
Machine: i386
>Description:
	
With many concurrent Samba accesses (reads and writes) on the machine, all
network connectivity appears to stop, and the system logs:

bge0: watchdog timeout -- resetting
bge0: block failed to stop: reg 0x3400, bit 0x00000002
bge0: block failed to stop: reg 0x2400, bit 0x00000002
bge0: block failed to stop: reg 0x1800, bit 0x00000002
bge0: block failed to stop: reg 0x4800, bit 0x00000002
bge0: block failed to stop: reg 0x3c00, bit 0x00000002
bge0: block failed to stop: reg 0x4c00, bit 0x00000002

It's unclear what is causing this, however the same card appeared to work fine
in a Windows2k server, so I don't believe this is a hardware issue.

>How-To-Repeat:
	
Seems that having more than 8 concurrent, active smbd connections reading or
writing causes this to happen, although it's unclear exactly why that
would trigger this.  There are higher network loads (throughput wise) during
other activities, and this doesn't seem to occur.
>Fix:
	
None that I'm aware of.

Interface configuration is as follows:

bge0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1504
        capabilities=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        enabled=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        address: 00:04:76:3b:1a:df
        media: Ethernet autoselect (100baseTX full-duplex)
        status: active
lo0: flags=8009<UP,LOOPBACK,MULTICAST> mtu 33220
        inet 127.0.0.1 netmask 0xff000000
vlan0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
        capabilities=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        enabled=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        vlan: 2 parent: bge0
        address: 00:04:76:3b:1a:df
        inet 66.27.90.40 netmask 0xfffff000 broadcast 255.255.255.255
vlan1: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
        capabilities=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        enabled=7<IP4CSUM,TCP4CSUM,UDP4CSUM>
        vlan: 1 parent: bge0
        address: 00:04:76:3b:1a:df
        inet 192.168.1.1 netmask 0xfffffff0 broadcast 192.168.1.15

Dmesg is as follows:

NetBSD 1.6I (INTREPID.MP) #0: Mon Oct 14 22:30:35 PDT 2002
    scotte@intrepid:/misc/localsrc/netbsd/netbsd_cvs_src/src/sys/arch/i386/compile/INTREPID.MP
total memory = 767 MB
avail memory = 710 MB
using 6144 buffers containing 39400 KB of memory
BIOS32 rev. 0 found at 0xfb2a0
PCI BIOS rev. 2.1 found at 0xfb2d0
PCI IRQ Routing Table rev. 1.0 found at 0xfdba0, size 192 bytes (10 entries)
PCI Interrupt Router at 000:07:0 (vendor 0x1106 product 0x0596)
PCI Exclusive IRQs: 5 10 11 12
mainbus0 (root)
mainbus0: Intel MP Specification (Version 1.4) (OEM00000 PROD00000000)
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel Pentium III (Katmai) (686-class), 602.25 MHz
cpu0: features 387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu0: features 387fbff<PGE,MCA,CMOV,FGPAT,PSE36,PN,MMX>
cpu0: features 387fbff<FXSR,SSE>
cpu0: I-cache 16 KB 32b/line 4-way, D-cache 16 KB 32b/line 4-way
cpu0: L2 cache 512 KB 32b/line 4-way
cpu0: ITLB 32 4 KB entries 4-way, 2 4 MB entries fully associative
cpu0: DTLB 64 4 KB entries 4-way, 8 4 MB entries 4-way
cpu0: serial number 0000-0673-0001-7410-01B6-49C8
cpu0: calibrating local timer
cpu0: apic clock running at 133 MHz
cpu0: 32 page colors
cpu1 at mainbus0: apid 1 (application processor)
cpu1: starting
cpu1: Intel Pentium III (Katmai) (686-class), 602.20 MHz
cpu1: features 387fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu1: features 387fbff<PGE,MCA,CMOV,FGPAT,PSE36,PN,MMX>
cpu1: features 387fbff<FXSR,SSE>
cpu1: I-cache 16 KB 32b/line 4-way, D-cache 16 KB 32b/line 4-way
cpu1: L2 cache 512 KB 32b/line 4-way
cpu1: ITLB 32 4 KB entries 4-way, 2 4 MB entries fully associative
cpu1: DTLB 64 4 KB entries 4-way, 8 4 MB entries 4-way
cpu1: serial number 0000-0673-0003-899B-4AD3-E890
mpbios: bus 0 is type PCI   
mpbios: bus 1 is type PCI   
mpbios: bus 2 is type ISA   
ioapic0 at mainbus0 apid 2 (I/O APIC)
ioapic0: pa 0xfec00000, version 11, 24 pins
pnpbios0 at mainbus0: nodes 12, max len 78
pckbc0 at pnpbios0 index 4 (PNP0303): kbd port
com0 at pnpbios0 index 12 (PNP0501)
com0: io 3f8-3ff, irq 4
com0: ns16550a, working fifo
com0: console
fdc0 at pnpbios0 index 13 (PNP0700)
fdc0: io 3f2-3f5, irq 6, dma 2
fdc0: ctl io 3f7 didn't probe. Forced attach
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: vendor 0x1106 product 0x0691 (rev. 0xc4)
agp0 at pchb0: aperture at 0xd0000000, size 0xe400000
ppb0 at pci0 dev 1 function 0: vendor 0x1106 product 0x8598 (rev. 0x00)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
pcib0 at pci0 dev 7 function 0
pcib0: vendor 0x1106 product 0x0596 (rev. 0x23)
pciide0 at pci0 dev 7 function 1: VIA Technologies VT82C596A (Apollo Pro) ATA66 controller
pciide0: bus-master DMA support present
pciide0: primary channel configured to compatibility mode
wd0 at pciide0 channel 0 drive 0: <WDC WD1200JB-75CRA0>
wd0: drive supports 16-sector PIO transfers, LBA addressing
wd0: 111 GB, 16383 cyl, 16 head, 63 sec, 512 bytes/sect x 234375000 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
pciide0: primary channel interrupting at irq 14
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 4 (Ultra/66) (using DMA data transfers)
pciide0: secondary channel configured to compatibility mode
atapibus0 at pciide0 channel 1: 2 targets
cd0 at atapibus0 drive 0: <685A, , 8.1B> cdrom removable
cd0: 32-bit data port
pciide0: secondary channel interrupting at irq 15
cd0(pciide0:1:0): using PIO mode 3
uhci0 at pci0 dev 7 function 2: vendor 0x1106 product 0x3038 (rev. 0x11)
uhci0: interrupting at apic 2 int 19 (irq 10)
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: vendor 0x1106 UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
pchb1 at pci0 dev 7 function 3
pchb1: vendor 0x1106 product 0x3050 (rev. 0x30)
bge0 at pci0 dev 9 function 0: Broadcom BCM5700 Gigabit Ethernet
bge0: interrupting at apic 2 int 16 (irq 11)
bge0: ASIC BCM5700 B1, Ethernet address 00:04:76:3b:1a:df
brgphy0 at bge0 phy 1: BCM5401 1000BASE-T media interface, rev. 3
brgphy0: using BCM5401 DSP patch
brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ohci0 at pci0 dev 14 function 0: vendor 0x1033 product 0x0035 (rev. 0x41)
ohci0: interrupting at apic 2 int 17 (irq 5)
ohci0: OHCI version 1.0
usb1 at ohci0: USB revision 1.0
uhub1 at usb1
uhub1: vendor 0x1033 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 3 ports with 3 removable, self powered
ohci1 at pci0 dev 14 function 1: vendor 0x1033 product 0x0035 (rev. 0x41)
ohci1: interrupting at apic 2 int 18 (irq 12)
ohci1: OHCI version 1.0
usb2 at ohci1: USB revision 1.0
uhub2 at usb2
uhub2: vendor 0x1033 OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
ehci0 at pci0 dev 14 function 2: vendor 0x1033 product 0x00e0 (rev. 0x02)
ehci0: interrupting at apic 2 int 19 (irq 10)
ehci0: EHCI version 0.95
ehci0: companion controllers, 3 ports each: ohci0 ohci1
usb3 at ehci0: USB revision 2.0
uhub3 at usb3
uhub3: vendor 0x1033 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub3: 5 ports with 5 removable, self powered
isa0 at pcib0
lm0 at isa0 port 0x290-0x297: W83782D
pcppi0 at isa0 port 0x61
midi0 at pcppi0: PC speaker
sysbeep0 at pcppi0
npx0 at isa0 port 0xf0-0xff: using exception 16
biomask 0 netmask 0 ttymask 0
ioapic0: enabling
boot device: wd0
root on wd0a dumps on wd0b
root file system type: ffs
cpu1: CPU 1 running
udsbr0 at uhub1 port 2
udsbr0: GemTek Corp USB FM Radio, rev 1.00/4.10, addr 2
radio0 at udsbr0
uhub1: device problem, disabling port 3
IP Filter: v3.4.29 initialized.  Default = pass all, Logging = enabled
>Release-Note:
>Audit-Trail:
>Unformatted: