Current-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: systems with -current wm(4) hang?



Using "if_wm.c" r1.407,

A ThinkPad T42 with:

wm0 at pci2 dev 1 function 0: Intel i82540EP 1000BASE-T Ethernet (rev. 0x03)
wm0: interrupting at irq 11
wm0: 32-bit 33MHz PCI bus
wm0: 64 words (6 address bits) MicroWire EEPROM
wm0: Ethernet address xx:xx:xx:xx:xx:xx 
makphy0 at wm0 phy 1: Marvell 88E1011 Gigabit PHY, rev. 4
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

cpu0: highest basic info 00000002
cpu0: highest extended info 80000004
cpu0: "Intel(R) Pentium(R) M processor 1.70GHz"
cpu0: Intel Pentium M (Dothan) (686-class), 1698.64 MHz
cpu0: family 0x6 model 0xd stepping 0x6 (id 0x6d6)
cpu0: features 0xafe9f9bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV>
cpu0: features 0xafe9f9bf<PAT,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,TM,SBF>
cpu0: features1 0x180<EST,TM2>
cpu0: I-cache 32KB 64B/line 8-way, D-cache 32KB 64B/line 8-way
cpu0: L2 cache 2MB 64B/line 8-way
cpu0: 64B prefetching
cpu0: ITLB 128 4KB entries 4-way, 2 4MB entries fully associative
cpu0: DTLB 128 4KB entries 4-way, 8 4MB entries 4-way
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: microcode version 0x18, platform ID 5

became unresponsive shortly after netbooting -current.  Following
reboot, it hung again while performing 'etcupdate' processing.


An IBM eServer x306, i386 with PCI/PCI-X busses has:

wm0 at pci1 dev 1 function 0: Intel i82547GI 1000BASE-T Ethernet (rev. 0x00)
wm0: interrupting at ioapic0 pin 18
wm0: Communication Streaming Architecture
wm0: 256 words (16 address bits) SPI EEPROM
wm0: Ethernet address xx:xx:xx:xx:xx:xx
igphy0 at wm0 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

wm1 at pci3 dev 3 function 0: Intel i82541GI 1000BASE-T Ethernet (rev. 0x00)
wm1: interrupting at ioapic0 pin 22
wm1: 32-bit 33MHz PCI bus
wm1: 256 words (8 address bits) MicroWire EEPROM
wm1: Ethernet address xx:xx:xx:xx:xx:xx
igphy1 at wm1 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

cpu0: highest basic info 00000005
cpu0: highest extended info 80000008
cpu0: "Intel(R) Pentium(R) 4 CPU 3.00GHz"
cpu0: Intel (686-class), 3000.62 MHz
cpu0: family 0xf model 0x4 stepping 0x1 (id 0xf41)
cpu0: features 0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE>
cpu0: features 0xbfebfbff<MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2>
cpu0: features 0xbfebfbff<SS,HTT,TM,SBF>
cpu0: features1 0x441d<SSE3,DTES64,MONITOR,DS-CPL,CID,xTPR>
cpu0: I-cache 12K uOp cache 8-way, D-cache 16KB 64B/line 8-way
cpu0: L2 cache 1MB 64B/line 8-way
cpu0: ITLB 4K/4M: 64 entries
cpu0: DTLB 4K/4M: 64 entries
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: SMT ID 0
cpu0: microcode version 0x17, platform ID 2

and booting from "wm0" hangs during multiuser boot while building the
"dev" database.

Booting from "wm1" completes multiuser boot and I was able to complete
'etcupdate' and 'postinstall' operations.  I built several packages
(${WRKOBJDIR} on local disk), but it eventually hung too.


An amd64-class machine (Dell Optiplex 760) with:

wm0 at pci0 dev 25 function 0: 82567LM-3 LAN Controller (rev. 0x02)
wm0: interrupting at msi0 vec 0
wm0: PCI-Express bus
wm0: 2048 words FLASH
wm0: Ethernet address xx:xx:xx:xx:xx:xx
makphy0 at wm0 phy 2: Marvell 88E1149 Gigabit PHY, rev. 1
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FD
X, auto

cpu0: highest basic info 0000000a
cpu0: highest extended info 80000008
cpu0: "Intel(R) Core(TM)2 Quad  CPU   Q8200  @ 2.33GHz"
cpu0: Intel Xeon 31xx, 33xx, 52xx, 54xx, Core 2 Quad 8xxx and 9xxx (686-class), 2327.64 MHz
cpu0: family 0x6 model 0x17 stepping 0x7 (id 0x10677)
cpu0: features 0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE>
cpu0: features 0xbfebfbff<MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2>
cpu0: features 0xbfebfbff<SS,HTT,TM,SBF>
cpu0: features1 0x8e39d<SSE3,DTES64,MONITOR,DS-CPL,EST,TM2,SSSE3,CX16,xTPR>
cpu0: features1 0x8e39d<PDCM,SSE41>
cpu0: features2 0x20100800<SYSCALL/SYSRET,XD,EM64T>
cpu0: features3 0x1<LAHF>
cpu0: I-cache 32KB 64B/line 8-way, D-cache 32KB 64B/line 8-way
cpu0: L2 cache 2MB 64B/line 8-way
cpu0: 64B prefetching
cpu0: ITLB 128 4KB entries 4-way, 8 2M/4 4M entries
cpu0: DTLB 256 4KB entries 4-way, 16 4MB entries 4-way
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: Core ID 0
cpu0: DSPM-eax 0x1<DTS>
cpu0: DSPM-ecx 0x1<HWF>
cpu0: SEF highest subleaf 00000000
cpu0: microcode version 0x705, platform ID 4

Booting an installation built from sources around 201605182230Z, all
seemed well.  Following an update to a system built from sources around
201605201620Z, the machine hung in "/etc/rc.d/fccache" during its first
boot.  Although the terminal driver responded, one could not drop into
DDB via the USB keyboard (system is USB-only until I can get the PS/2
serial adapter cable).  ACPI powerdown via the power switch hung as well,
so a forced power-off by holding the power button was required.

On the next boot, it completed startup but hung again during 'xdm'
initialization requiring another forced power-cycle.  Since disabling
'xdm', the machine has now booted multiuser.  Further stress testing
consisted of recursive 'pkg_delete' of old (GCC 4.8.5-built) packages
in preparation for rebuilding with GCC 5.3.0.  It eventually hung.

Using the same machine to investigate:

wm0 at pci1 dev 0 function 0: Intel i82574L (rev. 0x00)
wm0: for TX and RX interrupting at msix0 vec 0 affinity to 1
wm0: for TX and RX interrupting at msix0 vec 1 affinity to 2
wm0: for LINK interrupting at msix0 vec 2
wm0: PCI-Express bus
wm0: Please update the Bootagent
wm0: 2048 words FLASH, version 1.8.0, Image Unique ID 0000ffff
wm0: Ethernet address xx:xx:xx:xx:xx:xx
makphy0 at wm0 phy 1: Marvell 88E1149 Gigabit PHY, rev. 1
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

booted without problems.  Stress testing building packages did not
provoke a hang during the time I ran it (around 12 hours).

Again using the same machine to investigate:

wm1 at pci4 dev 0 function 0: Intel i82541PI 1000BASE-T Ethernet (rev. 0x05)
wm1: interrupting at ioapic0 pin 16
wm1: 32-bit 33MHz PCI bus
wm1: 64 words (6 address bits) MicroWire EEPROM
wm1: Ethernet address xx:xx:xx:xx:xx:xx
igphy0 at wm1 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

booted without problems.  Stress testing building packages hung rather
soon after.


A Dell PowerEdge 750 (i386) with PCI/PCI-X busses and:

wm0 at pci1 dev 1 function 0: Intel i82547GI 1000BASE-T Ethernet (rev. 0x00)
wm0: interrupting at ioapic0 pin 18
wm0: Communication Streaming Architecture
wm0: 64 words (6 address bits) MicroWire EEPROM
wm0: Ethernet address xx:xx:xx:xx:xx:xx
igphy0 at wm0 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

wm1 at pci3 dev 2 function 0: Intel i82541GI 1000BASE-T Ethernet (rev. 0x00)
wm1: interrupting at ioapic0 pin 21
wm1: 32-bit 33MHz PCI bus
wm1: 64 words (6 address bits) MicroWire EEPROM
wm1: Ethernet address xx:xx:xx:xx:xx:xx
igphy1 at wm1 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

cpu0: highest basic info 00000002
cpu0: highest extended info 80000004
cpu0: "Intel(R) Celeron(R) CPU 2.40GHz"
cpu0: Intel Celeron (686-class), 2400.19 MHz
cpu0: family 0xf model 0x2 stepping 0x9 (id 0xf29)
cpu0: features 0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE>
cpu0: features 0xbfebfbff<MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2>
cpu0: features 0xbfebfbff<SS,HTT,TM,SBF>
cpu0: features1 0x4400<CID,xTPR>
cpu0: I-cache 12K uOp cache 8-way, D-cache 8KB 64B/line 4-way
cpu0: L2 cache 128KB 64B/line 2-way
cpu0: ITLB 4K/4M: 128 entries
cpu0: DTLB 4K/4M: 64 entries
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: microcode version 0x2e, platform ID 2

booting from "wm0" hung during rebuilding "dev" database on first boot.
As the machine has PS/2 keyboard attached, "Ctrl-Alt-ESC" allowed dropping
to the debugger to reboot.  Subsequent boots with this interface hung in
the same place.

Booting from "wm1" completed startup and 'etcupdate'/'postinstall'
operations and completed its subsequent boot.  Further stress testing
building packages ran for a long time, but halted when the machine
spontaneously shut down and would not remain powered up.


A Dell PowerEdge 2850 (amd64-class, PCI/PCI-X busses) with:

wm0 at pci6 dev 7 function 0: Intel i82541GI 1000BASE-T Ethernet (rev. 0x05)
wm0: interrupting at ioapic2 pin 0
wm0: 32-bit 66MHz PCI bus
wm0: 512 words (16 address bits) SPI EEPROM
wm0: Ethernet address xx:xx:xx:xx:xx:xx
igphy0 at wm0 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

wm1 at pci7 dev 8 function 0: Intel i82541GI 1000BASE-T Ethernet (rev. 0x05)
wm1: interrupting at ioapic2 pin 1
wm1: 32-bit 66MHz PCI bus
wm1: 256 words (16 address bits) SPI EEPROM
wm1: Ethernet address xx:xx:xx:xx:xx:xx
igphy1 at wm1 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

cpu0: highest basic info 00000005
cpu0: highest extended info 80000008
cpu0: "Intel(R) Xeon(TM) CPU 3.40GHz"
cpu0: Intel (686-class), 3391.60 MHz
cpu0: family 0xf model 0x4 stepping 0x3 (id 0xf43)
cpu0: features 0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE>
cpu0: features 0xbfebfbff<MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2>
cpu0: features 0xbfebfbff<SS,HTT,TM,SBF>
cpu0: features1 0x649d<SSE3,DTES64,MONITOR,DS-CPL,EST,CID,CX16,xTPR>
cpu0: features2 0x20100800<SYSCALL/SYSRET,XD,EM64T>
cpu0: I-cache 12K uOp cache 8-way, D-cache 16KB 64B/line 8-way
cpu0: L2 cache 2MB 64B/line 8-way
cpu0: ITLB 4K/4M: 64 entries
cpu0: DTLB 4K/4M: 64 entries
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: SMT ID 0
cpu0: microcode version 0x4, platform ID 0

booting from "wm0" hung rebuilding the "dev" database during startup.

Booting from "wm1" passed the database rebuild but hung while updating
fontconfig cache.  Subsequent reboot with "wm1" succeeded.  Further
stress testing building packages eventually hung.


A Dell PowerEdge SC430 (amd64-class, PCI/PCIe busses) with:

wm0 at pci2 dev 0 function 0: Intel PRO/1000 PT (82571EB) (rev. 0x06)
wm0: interrupting at ioapic0 pin 16
wm0: PCI-Express bus
wm0: Please update the Bootagent
wm0: 2048 words (16 address bits) SPI EEPROM, version 5.6.2, Image Unique ID ffffffff
wm0: Ethernet address xx:xx:xx:xx:xx:xx
igphy0 at wm0 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
wm1 at pci2 dev 0 function 1: Intel PRO/1000 PT (82571EB) (rev. 0x06)
wm1: interrupting at ioapic0 pin 17
wm1: PCI-Express bus
wm1: 2048 words (16 address bits) SPI EEPROM, version 5.6.2, Image Unique ID ffffffff
wm1: Ethernet address xx:xx:xx:xx:xx:xx
igphy1 at wm1 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto

(an Intel Pro/1000 PT Dual Port Server Adapter) and

cpu0: highest basic info 00000005
cpu0: highest extended info 80000008
cpu0: "Intel(R) Pentium(R) 4 CPU 2.80GHz"
cpu0: Intel (686-class), 2793.23 MHz
cpu0: family 0xf model 0x4 stepping 0x9 (id 0xf49)
cpu0: features 0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE>
cpu0: features 0xbfebfbff<MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2>
cpu0: features 0xbfebfbff<SS,HTT,TM,SBF>
cpu0: features1 0x641d<SSE3,DTES64,MONITOR,DS-CPL,CID,CX16,xTPR>
cpu0: features2 0x20100800<SYSCALL/SYSRET,XD,EM64T>
cpu0: features3 0x1<LAHF>
cpu0: I-cache 12K uOp cache 8-way, D-cache 16KB 64B/line 8-way
cpu0: L2 cache 1MB 64B/line 8-way
cpu0: ITLB 4K/4M: 64 entries
cpu0: DTLB 4K/4M: 64 entries
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: SMT ID 0
cpu0: microcode version 0x3, platform ID 4

booted from "wm0" without problems.  Stress testing building packages did
not hang during my test period (about 12 hours).

The "wm1" interface on this card is not bootable.


Now I will update sources (if_wm.c r1.409+) rebuild and see if the
problem was fixed.

-- 
|/"\ John D. Baker, KN5UKS               NetBSD     Darwin/MacOS X
|\ / jdbaker[snail]mylinuxisp[flyspeck]com    OpenBSD            FreeBSD
| X  No HTML/proprietary data in email.   BSD just sits there and works!
|/ \ GPGkeyID:  D703 4A7E 479F 63F8 D3F4  BD99 9572 8F23 E4AD 1645



Home | Main Index | Thread Index | Old Index