Subject: Re: wm0 receive overrun after adding lsi fibre channel card.
To: Thor Lancelot Simon <tls@rek.tjls.com>
From: Jonathan Kay <jpk@panix.com>
List: port-i386
Date: 02/24/2005 12:14:35
On Thu, 24 Feb, 2005 at 11:50:49AM -0500, it was scribed by tls@rek.tjls.com that:
> On Thu, Feb 24, 2005 at 08:35:58AM -0800, Ed Gould wrote:
> > > but now about twice or four times a day I get
> > > 
> > > wm0: Received overrun
> > > 
> > > and the machine stop responding to pings for a ~10 seconds--it can still
> > > ping itself, but can't ping out.  it's light on the switch stays on.
> > > 
> > >   These might be related to lots of traffic coming through the ethernet,
> > > but I'm not 100% sure about this..  (sometimes it seems to happen when
> > > there is very little traffic goingon)
> > 
> > It looks to me like you may be overrunning the capacity of your PCI-X 
> > bus.  Everything is attached to it, and you've just added a high-speed 
> 
> Look at the dmesg line for the gigabit ethernet card: it claims it's
> running at 32-bit, 33MHz.  Of course, if that's true, it will drag the
> entire bus down to 33Mhz, discarding 3/4 of the bandwidth for all
> devices.
> 
> I wasn't aware there were any 32/33 'wm' cards.  If there are, and Jonathan
> has one, he should yank it from his machine ASAP and replace it with
> something that has less suck.  But the real culprit may well be some other
> 33MHz card; Jonathan didn't post the entire dmesg so there's no way to
> tell.
> 
> I think the Apple 'mpt' card is 64/66, so it will drag the bus down to
> 66MHz, but that's not so bad as running it all at 33.  Is there some
> other slow card on the bus?  If so, and the motherboard has multiple
> buses (most server boards do), rearranging things so the 33MHz devices
> are on one bus and the fast devices on another will be a big help.
 
the wm card is onboard, so can't really yank it, but if I were creative
with a soldering iron.. :-)

looking up whats actually in the 600sc, it has 1 pci bus--64/bit 33mhz.
oops.  though fwiw, the dell PE 1600sc (which has 3 pci buses: 1 PCI-X,
1 @ 64/66mhz and 1 and 32/33mhz) has an onboard wm0 which claims to be
runninat @ 32/33mhz too.

the 3ware RAID isn't doing anything any more--the system / and /usr live
on it--all the IO goes to the Xserve RAID now..

aside from the wm, twe, and mpt there are just some USB and IDE
controllers. the USB & IDE don't really get used--I could probably 
disable them all.
I have never run into any problems before w/ just the twe & wm, so I
never needed to fix it..  

I've included the whole dmesg here--just trying to save you some reading:

NetBSD 2.99.10 (CLUB-UNIX.MP) #4: Sat Feb 19 14:03:55 EST 2005
        root@club-unix.clubhouse.local:/usr/src/20041124/src/sys/arch/i386/compile/
CLUB-UNIX.MP
total memory = 1535 MB
avail memory = 1497 MB
BIOS32 rev. 0 found at 0xffe90
mainbus0 (root)
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: Intel Pentium 4 (686-class), 3066.02 MHz, id 0xf27
cpu0: features bfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu0: features bfebfbff<PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX>
cpu0: features bfebfbff<FXSR,SSE,SSE2,SS,HTT,TM,SBF>
cpu0: features2 400<CID>
cpu0: "Intel(R) Pentium(R) 4 CPU 3.06GHz"
cpu0: I-cache 12K uOp cache 8-way, D-cache 8 KB 64B/line 4-way
cpu0: L2 cache 512 KB 64B/line 8-way
cpu0: ITLB 4K/4M: 64 entries
cpu0: DTLB 4K/4M: 64 entries
cpu0: running without thermal monitor!
cpu0: calibrating local timer
cpu0: apic clock running at 133 MHz
cpu0: 16 page colors
cpu1 at mainbus0: apid 1 (application processor)
cpu1: starting
cpu1: Intel Pentium 4 (686-class), 3065.81 MHz, id 0xf27
cpu1: features bfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu1: features bfebfbff<PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX>
cpu1: features bfebfbff<FXSR,SSE,SSE2,SS,HTT,TM,SBF>
cpu1: features2 400<CID>
cpu1: "Intel(R) Pentium(R) 4 CPU 3.06GHz"
cpu1: I-cache 12K uOp cache 8-way, D-cache 8 KB 64B/line 4-way
cpu1: L2 cache 512 KB 64B/line 8-way
cpu1: ITLB 4K/4M: 64 entries
cpu1: DTLB 4K/4M: 64 entries
cpu1: running without thermal monitor!
ioapic0 at mainbus0 apid 2 (I/O APIC)
ioapic0: pa 0xfec00000, version 11, 16 pins
ioapic0: misconfigured as apic 0
ioapic0: remapped to apic 2
ioapic1 at mainbus0 apid 3 (I/O APIC)
ioapic1: pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0
ioapic1: remapped to apic 3
ioapic2 at mainbus0 apid 4 (I/O APIC)
ioapic2: pa 0xfec02000, version 11, 16 pins
ioapic2: misconfigured as apic 0
ioapic2: remapped to apic 4
acpi0 at mainbus0
acpi0: using Intel ACPI CA subsystem version 20040211
acpi0: X/RSDT: OemId <DELL  ,PE600SC ,00000001>, AslId <MSFT,0100000a>
acpi0: SCI interrupting at int 9
acpi0: fixed-feature power button present
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
acpi: activated PNP0C0F
PNP0A03 at acpi0 not configured
PNP0200 at acpi0 not configured
npx1 at acpi0 (PNP0C04)
npx1: io 0xf0-0xff irq 13
npx1: using exception 16
PNP0000 at acpi0 not configured
PNP0800 at acpi0 not configured
PNP0100 at acpi0 not configured
fdc1 at acpi0 (PNP0700)
fdc1: io 0x3f0-0x3f5,0x3f7 irq 6 drq 2
pckbc1 at acpi0 (PNP0303): kbd port
pckbc1: io 0x60,0x64 irq 1
pckbc2 at acpi0 (PNP0F13): aux port
pckbc2: irq 12
com0 at acpi0 (PNP0501-1)
com0: io 0x3f8-0x3ff irq 4
com0: ns16550a, working fifo
lpt0 at acpi0 (PNP0401)
lpt0: io 0x378-0x37f,0x778-0x77f irq 7 drq 1
PNP0B00 at acpi0 not configured
PNP0C01 at acpi0 not configured
PNP0C01 at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
PNP0C0F at acpi0 not configured
pckbd0 at pckbc1 (kbd slot)
pckbc1: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard
pms0 at pckbc1 (aux slot)
pckbc1: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: ServerWorks CMIC-SL PCI/AGP bridge (rev. 0x32)
pchb1 at pci0 dev 0 function 1
pchb1: ServerWorks CMIC-SL PCI/AGP bridge (rev. 0x00)
wm0 at pci0 dev 2 function 0: Intel i82540EM 1000BASE-T Ethernet, rev. 2
wm0: interrupting at ioapic1 pin 1 (irq 10)
wm0: 32-bit 33MHz PCI bus
wm0: 64 word (6 address bits) MicroWire EEPROM
wm0: Ethernet address 00:c0:9f:22:a9:51
makphy0 at wm0 phy 1: Marvell 88E1011 Gigabit PHY, rev. 3
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, 
auto
ohci0 at pci0 dev 3 function 0: NEC USB Host Controller (rev. 0x43)
ohci0: interrupting at ioapic1 pin 2 (irq 5)
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
ohci1 at pci0 dev 3 function 1: NEC USB Host Controller (rev. 0x43)
ohci1: interrupting at ioapic1 pin 3 (irq 10)
ohci1: OHCI version 1.0, legacy support
usb1 at ohci1: USB revision 1.0
uhub1 at usb1
uhub1: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 1 port with 1 removable, self powered
ehci0 at pci0 dev 3 function 2: NEC USB Host Controller (rev. 0x04)
ehci0: interrupting at ioapic1 pin 2 (irq 5)
ehci0: EHCI version 1.0
ehci0: companion controllers, 2 ports each: ohci0 ohci1
usb2 at ehci0: USB revision 2.0
uhub2 at usb2
uhub2: NEC EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub2: single transaction translator
uhub2: 3 ports with 3 removable, self powered
mpt0 at pci0 dev 5 function 0: LSI Logic FC929X FC Adapter
mpt0: interrupting at ioapic1 pin 7 (irq 5)
mpt0: Port 0: Link state Failed
mpt0: External Bus Reset
mpt0: Port 0: FC Link Event: LIP(f8,f7) (Loop Initialization)
mpt0:   Device detected loop failure before acquiring AL_PA
mpt0: Port 0: Link state Active
mpt0: Rescan Port 0
scsibus0 at mpt0: 256 targets, 8 luns per target
mpt1 at pci0 dev 5 function 1: LSI Logic FC929X FC Adapter
mpt1: interrupting at ioapic1 pin 8 (irq 10)
scsibus1 at mpt1: 256 targets, 8 luns per target
twe0 at pci0 dev 7 function 0: 3ware Escalade
twe0: interrupting at ioapic1 pin 11 (irq 5)
twe0: 4 ports, Firmware FE7X 1.05.00.036, BIOS BE7X 1.08.00.044
twe0: Monitor ME7X 1.01.00.035, PCB Rev3    , Achip V3.20   , Pchip V1.30   
twe0: port 0: WDC WD1200JB-00CRA0                      114473 MB
twe0: port 1: WDC WD1200JB-00CRA0                      114473 MB
twe0: port 2: WDC WD1200JB-00CRA0                      114473 MB
twe0: port 3: WDC WD1200JB-00CRA0                      114473 MB
ld0 at twe0 unit 0: 64K stripe RAID5, status: Normal
ld0: 335 GB, 43779 cyl, 255 head, 63 sec, 512 bytes/sect x 703318656 sectors
vga1 at pci0 dev 8 function 0: ATI Technologies Rage XL (rev. 0x27)
wsdisplay0 at vga1 kbdmux 1: console (80x25, vt100 emulation), using wskbd0
wsmux1: connecting to wsdisplay0
rccide0 at pci0 dev 14 function 0
rccide0: ServerWorks CSB6 RAID/IDE Controller (rev. 0xa0)
rccide0: bus-master DMA support present
rccide0: primary channel configured to native-PCI mode
rccide0: using ioapic0 pin 11 (irq 11) for native-PCI interrupt
atabus0 at rccide0 channel 0
rccide0: secondary channel wired to native-PCI mode
atabus1 at rccide0 channel 1
pchb2 at pci0 dev 15 function 0
pchb2: ServerWorks CSB6 southbridge (rev. 0xa0)
rccide1 at pci0 dev 15 function 1
rccide1: ServerWorks CSB6 RAID/IDE Controller (rev. 0xa0)
rccide1: bus-master DMA support present
rccide1: primary channel configured to compatibility mode
rccide1: primary channel interrupting at ioapic0 pin 14 (irq 14)
atabus2 at rccide1 channel 0
rccide1: secondary channel configured to compatibility mode
rccide1: secondary channel interrupting at ioapic0 pin 15 (irq 15)
atabus3 at rccide1 channel 1
ohci2 at pci0 dev 15 function 2: ServerWorks CSB6 USB Host Controller (rev. 0x05)
ohci2: interrupting at ioapic0 pin 3 (irq 3)
ohci2: OHCI version 1.0, legacy support
usb3 at ohci2: USB revision 1.0
uhub3 at usb3
uhub3: ServerWorks OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
pcib0 at pci0 dev 15 function 3
pcib0: ServerWorks CSB6 ISA/LPC bridge (rev. 0x00)
isa0 at pcib0
pcppi0 at isa0 port 0x61
sysbeep0 at pcppi0
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
isapnp0: no ISA Plug 'n Play devices found
ioapic0: enabling
ioapic1: enabling
ioapic2: enabling
fd0 at fdc1 drive 0: density unknown
Kernelized RAIDframe activated
scsibus0: waiting 2 seconds for devices to settle...
scsibus1: waiting 2 seconds for devices to settle...
atapibus0 at atabus0: 2 targets
cd0 at atapibus0 drive 0: <SAMSUNG CD-ROM  SC-148C, , B105> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(rccide0:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33) (using 
DMA)
sd0 at scsibus0 target 0 lun 0: <APPLE, Xserve RAID, 1.24> disk fixed
sd0: 1117 GB, 143080 cyl, 128 head, 128 sec, 512 bytes/sect x 2344222720 sectors
sd1 at scsibus0 target 0 lun 1: <APPLE, Xserve RAID, 1.24> disk fixed
sd1: 1117 GB, 143080 cyl, 128 head, 128 sec, 512 bytes/sect x 2344222720 sectors
boot device: ld0
root on ld0a dumps on ld0b
root file system type: ffs
cpu1: CPU 1 running