NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

kern/38568: NetBSD 4.99.62 kernel locks up frequently



>Number:         38568
>Category:       kern
>Synopsis:       NetBSD 4.99.62 kernel locks up frequently
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Sat May 03 16:35:00 +0000 2008
>Originator:     Matthias Scheler
>Release:        NetBSD 4.99.62
>Organization:
Matthias Scheler                                  http://zhadum.org.uk/
>Environment:
System: NetBSD lyssa.zhadum.org.uk 4.99.62 NetBSD 4.99.62 (GENERIC) #0: Sat May 
3 13:07:21 BST 2008 tron%lyssa.zhadum.org.uk@localhost:/src/sys/compile/GENERIC 
amd64
Architecture: x86_64
Machine: amd64
>Description:
After I upgrade my HP Proliant ML110 G4 from NetBSD 4.99.51 to 4.99.52 the
machine locks up frequently. I can't provided much debugging information
unfortunately because "reboot -d" didn't work and I can't break into the
debugger because the machine only has a USB keyboard. Here are the
kernel messages:

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
    2006, 2007, 2008
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 4.99.62 (GENERIC) #0: Sat May  3 13:07:21 BST 2008
        tron%lyssa.zhadum.org.uk@localhost:/src/sys/compile/GENERIC
total memory = 5118 MB
avail memory = 4924 MB
timecounter: Timecounters tick every 10.000 msec
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
SMBIOS rev. 2.4 @ 0xdc010 (48 entries)
HP ProLiant ML110 G4 (1.0)
mainbus0 (root)
cpu0 at mainbus0 apid 0: (boot processor)
cpu0: Intel(R) Xeon(R) CPU            3040  @ 1.86GHz, 1862.10 MHz
cpu0: features bffbfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu0: features bffbfbff<PGE,MCA,CMOV,PAT,PSE36,CFLUSH,B20,DS,ACPI,MMX>
cpu0: features bffbfbff<FXSR,SSE,SSE2,SS,HTT,TM,SBF>
cpu0: features2 
e3bd<SSE3,DTES64,MONITOR,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM>
cpu0: features3 bffbfbff<SYSCALL/SYSRET,XD,EM64T>
cpu0: L2 cache 2 MB 64B/line 8-way
cpu0: Initial APIC ID 0
cpu0: Cluster/Package ID 0
cpu0: Core ID 0
cpu0: calibrating local timer
cpu0: apic clock running at 266 MHz
cpu0: 64 page colors
cpu1 at mainbus0 apid 1: (application processor)
cpu1: Intel(R) Xeon(R) CPU            3040  @ 1.86GHz, 1862.11 MHz
cpu1: features bffbfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu1: features bffbfbff<PGE,MCA,CMOV,PAT,PSE36,CFLUSH,B20,DS,ACPI,MMX>
cpu1: features bffbfbff<FXSR,SSE,SSE2,SS,HTT,TM,SBF>
cpu1: features2 
e3bd<SSE3,DTES64,MONITOR,DS-CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM>
cpu1: features3 bffbfbff<SYSCALL/SYSRET,XD,EM64T>
cpu1: L2 cache 2 MB 64B/line 8-way
cpu1: Initial APIC ID 1
cpu1: Cluster/Package ID 0
cpu1: Core ID 1
ioapic0 at mainbus0 apid 2: pa 0xfec00000, version 20, 24 pins
acpi0 at mainbus0: Advanced Configuration and Power Interface
acpi0: using Intel ACPI CA subsystem version 20080321
acpi0: X/RSDT: OemId <    HP,ML110 G4,06040000>, AslId <FOXC,00000000>
acpi0: SCI interrupting at int 9
acpi0: fixed-feature power button present
timecounter: Timecounter "ACPI-Fast" frequency 3579545 Hz quality 1000
ACPI-Fast 24-bit timer
CPU0 (ACPI Object Type 'Processor' [0x0c]) at acpi0 not configured
CPU1 (ACPI Object Type 'Processor' [0x0c]) at acpi0 not configured
CPU2 (ACPI Object Type 'Processor' [0x0c]) at acpi0 not configured
CPU3 (ACPI Object Type 'Processor' [0x0c]) at acpi0 not configured
MI0 (IPI0001) ACPI Error (nsxfeval-0213): Incorrect return type [Buffer] 
requested [String] [20080321]
at acpi0 not configured
PCI0 (PNP0A03) [PCI/PCI-X Host Bridge] at acpi0 not configured
MBRD (PNP0C02) [Plug and Play motherboard register resources] at acpi0 not 
configured
DMAC (PNP0200) [AT DMA Controller] at acpi0 not configured
MATH (PNP0C04) [Math Coprocessor] at acpi0 not configured
PIC (PNP0000) [AT Interrupt Controller] at acpi0 not configured
RTC (PNP0B00) [AT Real-Time Clock] at acpi0 not configured
pcppi1 at acpi0 (SPKR, PNP0800)
pcppi1: io 0x61
midi0 at pcppi1: PC speaker (CPU-intensive output)
sysbeep0 at pcppi1
attimer1 at acpi0 (TIMR, PNP0100): AT Timer
attimer1: io 0x40-0x43,0x50-0x53 irq 0
LNKA (PNP0C0F) [PCI interrupt link device] at acpi0 not configured
LNKB (PNP0C0F) [PCI interrupt link device] at acpi0 not configured
LNKC (PNP0C0F) [PCI interrupt link device] at acpi0 not configured
LNKD (PNP0C0F) [PCI interrupt link device] at acpi0 not configured
LNKH (PNP0C0F) [PCI interrupt link device] at acpi0 not configured
FWH (INT0800) [Intel FWH Random Number Generator] at acpi0 not configured
SIOD (PNP0A05) [Generic Container Device] at acpi0 not configured
COMA (PNP0501) [16550A-compatible COM port] at acpi0 not configured
COMB (PNP0501) [16550A-compatible COM port] at acpi0 not configured
acpibut0 at acpi0 (PWRB, PNP0C0C): ACPI Power Button
attimer1: attached to pcppi1
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: Intel E7230 Host (rev. 0xc0)
ppb0 at pci0 dev 28 function 0: Intel 82801GB/GR PCI Express Port #1 (rev. 0x01)
pci1 at ppb0 bus 2
pci1: no spaces enabled!
ppb1 at pci0 dev 28 function 4: Intel 82801GB/GR PCI Express Port #5 (rev. 0x01)
pci2 at ppb1 bus 3
pci2: i/o space, memory space enabled
vga0 at pci2 dev 0 function 0: Matrox MGA G200e (ServerEngines) (rev. 0x02)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation)
wsmux1: connecting to wsdisplay0
direct rendering for vga0 unsupported
ppb2 at pci0 dev 28 function 5: Intel 82801GB/GR PCI Express Port #6 (rev. 0x01)
pci3 at ppb2 bus 4
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
bge0 at pci3 dev 0 function 0: Broadcom BCM5721 Gigabit Ethernet
bge0: interrupting at ioapic0 pin 17 (irq 12)
bge0: ASIC unknown BCM575x family (0x4201), Ethernet address 00:1c:c4:5f:0d:9b
bge0: setting short Tx thresholds
brgphy0 at bge0 phy 1: BCM5750 1000BASE-T media interface, rev. 0
brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 
1000baseT-FDX, auto
uhci0 at pci0 dev 29 function 0: Intel 82801GB/GR USB UHCI Controller (rev. 
0x01)
uhci0: interrupting at ioapic0 pin 23 (irq 10)
usb0 at uhci0: USB revision 1.0
uhci1 at pci0 dev 29 function 1: Intel 82801GB/GR USB UHCI Controller (rev. 
0x01)
uhci1: interrupting at ioapic0 pin 19 (irq 5)
usb1 at uhci1: USB revision 1.0
uhci2 at pci0 dev 29 function 2: Intel 82801GB/GR USB UHCI Controller (rev. 
0x01)
uhci2: interrupting at ioapic0 pin 18 (irq 11)
usb2 at uhci2: USB revision 1.0
uhci3 at pci0 dev 29 function 3: Intel 82801GB/GR USB UHCI Controller (rev. 
0x01)
uhci3: interrupting at ioapic0 pin 16 (irq 7)
usb3 at uhci3: USB revision 1.0
ehci0 at pci0 dev 29 function 7: Intel 82801GB/GR USB EHCI Controller (rev. 
0x01)
ehci0: interrupting at ioapic0 pin 23 (irq 10)
ehci0: BIOS refuses to give up ownership, using force
ehci0: EHCI version 1.0
ehci0: companion controllers, 2 ports each: uhci0 uhci1 uhci2 uhci3
usb4 at ehci0: USB revision 2.0
ppb3 at pci0 dev 30 function 0: Intel 82801BA Hub-PCI Bridge (rev. 0xe1)
pci4 at ppb3 bus 10
pci4: i/o space, memory space enabled
pcib0 at pci0 dev 31 function 0
pcib0: Intel 82801GB/GR LPC Interface Bridge (rev. 0x01)
piixide0 at pci0 dev 31 function 1
piixide0: Intel 82801GB/GR IDE Controller (ICH7) (rev. 0x01)
piixide0: bus-master DMA support present
piixide0: primary channel configured to compatibility mode
piixide0: primary channel interrupting at ioapic0 pin 14 (irq 14)
atabus0 at piixide0 channel 0
piixide0: secondary channel configured to compatibility mode
piixide0: secondary channel interrupting at ioapic0 pin 15 (irq 15)
atabus1 at piixide0 channel 1
piixide1 at pci0 dev 31 function 2
piixide1: Intel 82801GB/GR Serial ATA/Raid Controller (ICH7) (rev. 0x01)
piixide1: bus-master DMA support present
piixide1: primary channel configured to native-PCI mode
piixide1: using ioapic0 pin 19 (irq 5) for native-PCI interrupt
atabus2 at piixide1 channel 0
piixide1: secondary channel configured to native-PCI mode
atabus3 at piixide1 channel 1
isa0 at pcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
atapibus0 at atabus0: 2 targets
uhub0 at usb0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub1 at usb1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhub2 at usb2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhub3 at usb3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub3: 2 ports with 2 removable, self powered
uhub4 at usb4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub4: 8 ports with 8 removable, self powered
cd0 at atapibus0 drive 0: <TSSTcorp CDW/DVD TS-H492C, , TB01> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(piixide0:0:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA)
uhub5 at uhub0 port 1: Texas Instruments TUSB2046 hub, class 9/0, rev 
1.10/1.25, addr 2
uhub5: 4 ports with 4 removable, self powered
wd0 at atabus2 drive 0: <FB160C4081>
wd0: drive supports 16-sector PIO transfers, LBA48 addressing
wd0: 149 GB, 310101 cyl, 16 head, 63 sec, 512 bytes/sect x 312581808 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 5 (Ultra/100)
wd0(piixide1:0:0): using PIO mode 4, Ultra-DMA mode 5 (Ultra/100) (using DMA)
Kernelized RAIDframe activated
pad0: outputs: 44100Hz, 16-bit, stereo
audio0 at pad0: half duplex
uhidev0 at uhub3 port 1 configuration 1 interface 0
uhidev0: ServerEngines SE USB Device, rev 1.10/0.01, addr 2, iclass 3/1
ukbd0 at uhidev0
boot device: wd0
root on wd0a dumps on wd0b
root file system type: ffs
wskbd0 at ukbd0: console keyboard, using wsdisplay0
uhidev1 at uhub3 port 1 configuration 1 interface 1uhidev2 at uhub5 port 2 
configuration 1 interface 0
uhidev1: ServerEngines SE USB Device, rev 1.10/0.01, addr 2, iclass 3/1
ums0 at uhidev1: 8 buttons and Z dir.
wsmouse0 at ums0 mux 0

uhidev2: USB KVM Switch USB KVM Switch, rev 1.10/1.00, addr 3, iclass 3/1
ukbd1 at uhidev2
wskbd1 at ukbd1 mux 1
wskbd1: connecting to wsdisplay0
uhidev3 at uhub5 port 2 configuration 1 interface 1
uhidev3: vendor 0x06f2 product 0x0011, rev 1.10/1.00, addr 3, iclass 3/1
ums1 at uhidev3: 5 buttons and Z dir.
wsmouse1 at ums1 mux 0
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)
nfs server colwyn:/export/home: not responding
nfs server colwyn:/export/home: is alive again

The NFS errors are a bit suspicious because the NFS server was running
all the time and is connected to the machine via a 1Gb/s ethernet switch.
I'm using NIS, amd(8) and an NFS mounted home directory on the machine.
I've seen lock-ups before but could never reproduce them.

The 4.99.51 kernel worked stable enough to build the new userland and
kernel with "-j 4".

>How-To-Repeat:
It happended twice while I tried to build packages and again when I tried
to submit a PR on the affected machine.

>Fix:
None provided.



Home | Main Index | Thread Index | Old Index