Subject: kern/37011: 4.0_RC1 lock-up
To: None <kern-bug-people@netbsd.org, gnats-admin@netbsd.org,>
From: None <andreas@planix.com>
List: netbsd-bugs
Date: 09/20/2007 15:00:01
>Number:         37011
>Category:       kern
>Synopsis:       4.0_RC1 lock-up
>Confidential:   no
>Severity:       serious
>Priority:       high
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Thu Sep 20 15:00:00 +0000 2007
>Originator:     Andreas Wrede
>Release:        NetBSD 4.0_RC1
>Organization:
Andreas Wrede              Planix, Inc.
andreas@planix.com         Networking, System Administration, Consulting
http://www.planix.com      Toronto, Ontario, Canada

"The steady state of disks is full."
                               -- Ken Thompson
>Environment:
	
	
System: NetBSD whome.planix.com 4.0_RC1 NetBSD 4.0_RC1 (PLANIX.MPACPI) #279: Mon Sep 17 06:03:14 EDT 2007 root@whome.planix.com:/u2/netbsd-4.0/obj.i386/sys/arch/i386/compile/PLANIX.MPACPI i386

Architecture: i386
Machine: i386
>Description:
	After running for 36 hours on RC1, the machine locked up.

Backtrace:

Stopped at      netbsd:cpu_Debugger+0x4:        leave
db{0}> bt
cpu_Debugger(c1cec060,c1cec060,c1d0b024,c1d0c000,7f8) at netbsd:cpu_Debugger+0x4

comintr(c1cec000,8,10,c1a90030,10) at netbsd:comintr+0x6fa
Xintr_ioapic_edge3() at netbsd:Xintr_ioapic_edge3+0x9c
--- interrupt ---
_kernel_lock(42,0,7,d,c1d06b00) at netbsd:_kernel_lock+0x80
x86_softintlock(10,c1a90030,10,10,c0815de0) at netbsd:x86_softintlock+0xd
DDB lost frame for netbsd:Xsoftserial+0x18, trying 0xcc542e2c
Xsoftserial() at netbsd:Xsoftserial+0x18
--- interrupt ---
Bad frame pointer: 0xc07aace0
0x202:

The kernel does not have LOCKDEBUG or DIAGNOSTIC (which will be corrected)

boot messages:
NetBSD 4.0_RC1 (PLANIX.MPACPI) #279: Mon Sep 17 06:03:14 EDT 2007
        root@whome.planix.com:/u2/netbsd-4.0/obj.i386/sys/arch/i386/compile/PLANIX.MPACPI
total memory = 1022 MB
rbus: rbus_min_start set to 0x40000000
avail memory = 996 MB
timecounter: Timecounters tick every 1.000 msec
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
BIOS32 rev. 0 found at 0xfd5c0
mainbus0 (root)
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: AMD Opteron or Athlon 64 FX (686-class), 2009.33 MHz, id 0xf5a
cpu0: features e7dbfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu0: features e7dbfbff<PGE,MCA,CMOV,PAT,PSE36,MPC,NOX,MMXX,MMX>
cpu0: features e7dbfbff<FXSR,SSE,SSE2,LONG,3DNOW2,3DNOW>
cpu0: "AMD Opteron(tm) Processor 246"
cpu0: I-cache 64 KB 64B/line 2-way, D-cache 64 KB 64B/line 2-way
cpu0: L2 cache 1 MB 64B/line 16-way
cpu0: ITLB 32 4 KB entries fully associative, 8 4 MB entries fully associative
cpu0: DTLB 32 4 KB entries fully associative, 8 4 MB entries fully associative
cpu0: AMD Power Management features: f<TTP,VID,FID,TS>
cpu0: calibrating local timer
cpu0: apic clock running at 200 MHz
cpu0: 16 page colors
cpu1 at mainbus0: apid 1 (application processor)
cpu1: starting
cpu1: AMD Opteron or Athlon 64 FX (686-class), 2009.27 MHz, id 0xf5a
cpu1: features e7dbfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu1: features e7dbfbff<PGE,MCA,CMOV,PAT,PSE36,MPC,NOX,MMXX,MMX>
cpu1: features e7dbfbff<FXSR,SSE,SSE2,LONG,3DNOW2,3DNOW>
cpu1: "AMD Opteron(tm) Processor 246"
cpu1: I-cache 64 KB 64B/line 2-way, D-cache 64 KB 64B/line 2-way
cpu1: L2 cache 1 MB 64B/line 16-way
cpu1: ITLB 32 4 KB entries fully associative, 8 4 MB entries fully associative
cpu1: DTLB 32 4 KB entries fully associative, 8 4 MB entries fully associative
cpu1: AMD Power Management features: f<TTP,VID,FID,TS>
ioapic0 at mainbus0 apid 2 (I/O APIC)
ioapic0: pa 0xfec00000, version 11, 24 pins
ioapic1 at mainbus0 apid 3 (I/O APIC)
ioapic1: pa 0xdf200000, version 11, 4 pins
ioapic2 at mainbus0 apid 4 (I/O APIC)
ioapic2: pa 0xdf201000, version 11, 4 pins
acpi0 at mainbus0: Advanced Configuration and Power Interface
acpi0: using Intel ACPI CA subsystem version 20060217
acpi0: X/RSDT: OemId <PTLTD ,  RSDT  ,06040000>, AslId < LTP,00000000>
acpi0: SCI interrupting at int 9
acpi0: fixed-feature power button present
timecounter: Timecounter "ACPI-Safe" frequency 3579545 Hz quality 900
ACPI-Safe 24-bit timer
mpacpi: could not get bus number, assuming bus 0
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
acpibut0 at acpi0 (PNP0C0C): ACPI Power Button
PNP0C01 [System Board] at acpi0 not configured
PNP0A03 [PCI/PCI-X Host Bridge] at acpi0 not configured
PNP0C02 [Plug and Play motherboard register resources] at acpi0 not configured
PNP0C02 [Plug and Play motherboard register resources] at acpi0 not configured
PNP0000 [AT Interrupt Controller] at acpi0 not configured
attimer1 at acpi0 (PNP0100): AT Timer
attimer1: io 0x40-0x43 irq 0
PNP0200 [AT DMA Controller] at acpi0 not configured
pcppi1 at acpi0 (PNP0800)
pcppi1: io 0x61
pcppi1: children must have an explicit unit
midi0 at pcppi1: PC speaker (CPU-intensive output)
sysbeep0 at pcppi1
PNP0B00 [AT Real-Time Clock] at acpi0 not configured
npx0 at acpi0 (PNP0C04)
npx0: io 0xf0-0xf1 irq 13
npx0: reported by CPUID; using exception 16
PNP0A05 [Generic Container Device] at acpi0 not configured
pckbc0 at acpi0 (PNP0F13): aux port
pckbc0: irq 12
pckbc1 at acpi0 (PNP0303): kbd port
pckbc1: io 0x60,0x64 irq 1
com0 at acpi0 (PNP0501-1)
com0: io 0x3f8-0x3ff irq 4
com0: ns16550a, working fifo
com1 at acpi0 (PNP0501-2)
com1: io 0x2f8-0x2ff irq 3
com1: ns16550a, working fifo
com1: console
fdc0 at acpi0 (PNP0700-1)
fdc0: io 0x3f0-0x3f5,0x3f7 irq 6 drq 2
lpt0 at acpi0 (PNP0401-2)
lpt0: io 0x378-0x37f,0x778-0x77f irq 7 drq 3
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0A03 [PCI/PCI-X Host Bridge] at acpi0 not configured
pcppi1: attached to attimer1
pckbd0 at pckbc1 (kbd slot)
pckbc1: using irq 1 for kbd slot
wskbd0 at pckbd0 mux 1
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
NVIDIA nForce4 Memory Controller (miscellaneous memory, revision 0xa3) at pci0 dev 0 function 0 not configured
pcib0 at pci0 dev 1 function 0
pcib0: NVIDIA nForce4 PCI-ISA bridge (rev. 0xa3)
NVIDIA nForce4 SMBus (SMBus serial bus, revision 0xa2) at pci0 dev 1 function 1 not configured
ohci0 at pci0 dev 2 function 0: NVIDIA nForce4 USB Host Controller (rev. 0xa2)
LUS0: Picked IRQ 20 with weight 0
ohci0: interrupting at ioapic0 pin 20 (irq 10)
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: NVIDIA OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 10 ports with 10 removable, self powered
ehci0 at pci0 dev 2 function 1: NVIDIA nForce4 USB2 Host Controller (rev. 0xa3)
LUS2: Picked IRQ 21 with weight 0
ehci0: interrupting at ioapic0 pin 21 (irq 11)
ehci0: BIOS has given up ownership
ehci0: EHCI version 1.0
ehci0: companion controller, 4 ports each: ohci0
usb1 at ehci0: USB revision 2.0
uhub1 at usb1
uhub1: NVIDIA EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub1: 10 ports with 10 removable, self powered
viaide0 at pci0 dev 6 function 0
viaide0: NVIDIA nForce4 IDE Controller (rev. 0xf2)
viaide0: bus-master DMA support present
viaide0: primary channel configured to compatibility mode
viaide0: primary channel ignored (disabled)
viaide0: secondary channel configured to compatibility mode
viaide0: secondary channel interrupting at ioapic0 pin 15 (irq 15)
atabus0 at viaide0 channel 1
viaide1 at pci0 dev 7 function 0
viaide1: NVIDIA nForce4 Serial ATA Controller (rev. 0xf3)
viaide1: bus-master DMA support present
viaide1: primary channel wired to native-PCI mode
LTID: Picked IRQ 22 with weight 0
viaide1: using ioapic0 pin 22 (irq 10) for native-PCI interrupt
atabus1 at viaide1 channel 0
viaide1: secondary channel wired to native-PCI mode
atabus2 at viaide1 channel 1
viaide2 at pci0 dev 8 function 0
viaide2: NVIDIA nForce4 Serial ATA Controller (rev. 0xf3)
viaide2: bus-master DMA support present
viaide2: primary channel wired to native-PCI mode
LSI1: Picked IRQ 23 with weight 0
viaide2: using ioapic0 pin 23 (irq 11) for native-PCI interrupt
atabus3 at viaide2 channel 0
viaide2: secondary channel wired to native-PCI mode
atabus4 at viaide2 channel 1
ppb0 at pci0 dev 9 function 0: NVIDIA nForce4 PCI Host Bridge (rev. 0xa2)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
wm0 at pci1 dev 4 function 0: Intel i82541GI 1000BASE-T Ethernet, rev. 0
LNK1: Picked IRQ 16 with weight 0
wm0: interrupting at ioapic0 pin 16 (irq 11)
wm0: 32-bit 33MHz PCI bus
wm0: 64 word (6 address bits) MicroWire EEPROM
wm0: Ethernet address 00:0e:0c:65:e3:a1
igphy0 at wm0 phy 1: Intel IGP01E1000 Gigabit PHY, rev. 0
igphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
vga0 at pci1 dev 6 function 0: ATI Technologies Rage XL (rev. 0x27)
wsdisplay0 at vga0 kbdmux 1
wsmux1: connecting to wsdisplay0
wskbd0: connecting to wsdisplay0
fxp0 at pci1 dev 8 function 0: i82550 Ethernet, rev 16
LNK3: Picked IRQ 17 with weight 0
fxp0: interrupting at ioapic0 pin 17 (irq 10)
fxp0: Ethernet address 00:e0:81:30:d6:0a
inphy0 at fxp0 phy 1: i82555 10/100 media interface, rev. 4
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
ppb1 at pci0 dev 13 function 0: NVIDIA nForce4 PCIe Host Bridge (rev. 0xa3)
pci2 at ppb1 bus 2
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
ppb2 at pci0 dev 14 function 0: NVIDIA nForce4 PCIe Host Bridge (rev. 0xa3)
pci3 at ppb2 bus 3
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
pchb0 at pci0 dev 24 function 0
pchb0: Advanced Micro Devices AMD64 HyperTransport configuration (rev. 0x00)
pchb1 at pci0 dev 24 function 1
pchb1: Advanced Micro Devices AMD64 Address Map configuration (rev. 0x00)
pchb2 at pci0 dev 24 function 2
pchb2: Advanced Micro Devices AMD64 DRAM configuration (rev. 0x00)
pchb3 at pci0 dev 24 function 3
pchb3: Advanced Micro Devices AMD64 Miscellaneous configuration (rev. 0x00)
pchb4 at pci0 dev 25 function 0
pchb4: Advanced Micro Devices AMD64 HyperTransport configuration (rev. 0x00)
pchb5 at pci0 dev 25 function 1
pchb5: Advanced Micro Devices AMD64 Address Map configuration (rev. 0x00)
pchb6 at pci0 dev 25 function 2
pchb6: Advanced Micro Devices AMD64 DRAM configuration (rev. 0x00)
pchb7 at pci0 dev 25 function 3
pchb7: Advanced Micro Devices AMD64 Miscellaneous configuration (rev. 0x00)
isa0 at pcib0
lm0 at isa0 port 0x290-0x297: Winbond W83627HF Hardware monitor
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
isapnp0: no ISA Plug 'n Play devices found
pci4 at mainbus0 bus 9
pci4: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pci5 at mainbus0 bus 10
pci5: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
mpt0 at pci5 dev 3 function 0: Symbios Logic FC929
mpt0: interrupting at ioapic2 pin 2 (irq 10)
scsibus0 at mpt0: 255 targets, 8 luns per target
mpt1 at pci5 dev 3 function 1: Symbios Logic FC929
mpt1: interrupting at ioapic2 pin 3 (irq 12)
scsibus1 at mpt1: 255 targets, 8 luns per target
bge0 at pci5 dev 9 function 0: Broadcom BCM5704C Dual Gigabit Ethernet
bge0: interrupting at ioapic2 pin 0 (irq 11)
bge0: firmware handshake timed out, val = 4b657654
bge0: RX CPU self-diagnostics failed!
bge0: chip initialization failed
bge1 at pci5 dev 9 function 1: Broadcom BCM5704C Dual Gigabit Ethernet
bge1: interrupting at ioapic2 pin 1 (irq 12)
bge1: firmware handshake timed out, val = 4b657654
bge1: RX CPU self-diagnostics failed!
bge1: chip initialization failed
ioapic0: enabling
ioapic1: enabling
ioapic2: enabling
timecounter: Timecounter "clockinterrupt" frequency 1000 Hz quality 0
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
Kernelized RAIDframe activated
IPsec: Initialized Security Association Processing.
atapibus0 at atabus0: 2 targets
scsibus0: waiting 2 seconds for devices to settle...
cd0 at atapibus0 drive 0: <HL-DT-STDVD-ROM GDR8164B, , 0L06> cdrom removable
scsibus1: waiting 2 seconds for devices to settle...
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(viaide0:1:0): using PIO mode 4, Ultra-DMA mode 2 (Ultra/33) (using DMA)
viaide1 port 0: device present, speed: 3.0Gb/s
viaide2 port 0: device present, speed: 3.0Gb/s
wd0 at atabus1 drive 0: <WDC WD1600JS-22MHB0>
wd0: drive supports 16-sector PIO transfers, LBA48 addressing
wd0: 149 GB, 310101 cyl, 16 head, 63 sec, 512 bytes/sect x 312581808 sectors
wd0: 32-bit data port
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd0(viaide1:0:0): using PIO mode 4, Ultra-DMA mode 6 (Ultra/133) (using DMA)
wd1 at atabus3 drive 0: <WDC WD1600JS-00MHB0>
wd1: drive supports 16-sector PIO transfers, LBA48 addressing
wd1: 149 GB, 310101 cyl, 16 head, 63 sec, 512 bytes/sect x 312581808 sectors
wd1: 32-bit data port
wd1: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd1(viaide2:0:0): using PIO mode 4, Ultra-DMA mode 6 (Ultra/133) (using DMA)
uhub2 at uhub1 port 3
uhub2: NEC product 0x0050, class 9/0, rev 2.00/1.00, addr 2
uhub2: single transaction translator
uhub2: 7 ports with 7 removable, self powered
sd0 at scsibus0 target 0 lun 0: <APPLE, Xserve RAID, 1.51> disk fixed
sd0: 2794 GB, 357701 cyl, 128 head, 128 sec, 512 bytes/sect x 5860573184 sectors
sd0: mbr partition exceeds disk size
sd0: GPT GUID: c9dc5440-384a-11dc-984c-000e0c65e3a1
dk0 at sd0: sd0-DV
dk0: 2147483582 blocks at 34, type: ffs
dk1 at sd0: sd0-u7
dk1: 2147483616 blocks at 2147483616, type: ffs
dk2 at sd0: sd0-u6
dk2: 1565605919 blocks at 4294967232, type: ffs
sd1 at scsibus1 target 0 lun 0: <APPLE, Xserve RAID, 1.51> disk fixed
sd1: 2794 GB, 357701 cyl, 128 head, 128 sec, 512 bytes/sect x 5860573184 sectors
sd1: mbr partition exceeds disk size
sd1: GPT GUID: 9d99e526-3eeb-11dc-95ad-000e0c65e3a1
dk3 at sd1: sd1-DV
dk3: 2147483582 blocks at 34, type: ffs
dk4 at sd1: sd1-u7
dk4: 2147483616 blocks at 2147483616, type: ffs
dk5 at sd1: sd1-u6
dk5: 1565605919 blocks at 4294967232, type: ffs
uplcom0 at uhub2 port 1
uplcom0: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 3
ucom0 at uplcom0
uplcom1 at uhub2 port 2
uplcom1: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 4
ucom1 at uplcom1
uplcom2 at uhub2 port 3
uplcom2: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 5
ucom2 at uplcom2
uplcom3 at uhub2 port 4
uplcom3: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 6
ucom3 at uplcom3
uplcom4 at uhub2 port 5
uplcom4: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 7
ucom4 at uplcom4
uplcom5 at uhub2 port 6
uplcom5: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 8
ucom5 at uplcom5
uhub3 at uhub2 port 7
uhub3: NEC 2.0 hub, class 9/0, rev 2.00/1.00, addr 9
uhub3: single transaction translator
uhub3: 4 ports with 4 removable, self powered
uplcom6 at uhub3 port 1
uplcom6: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 10
ucom6 at uplcom6
uplcom7 at uhub3 port 2
uplcom7: Prolific Technology Inc. USB-Serial Controller, rev 1.10/3.00, addr 11
ucom7 at uplcom7
raid0: RAID Level 1
raid0: Components: /dev/wd0a /dev/wd1a
raid0: Total Sectors: 312581632 (152627 MB)
boot device: raid0
root on raid0a dumps on raid0b
root file system type: ffs
cpu1: CPU 1 running
Zapata Telephony Interface Registered on major 196
Registered Span 1 ('ZTDUMMY/1') with 0 channels
Span ('ZTDUMMY/1') is new master
ztdummy: init() finished
ztdummy: loaded
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)
Registered tone zone 0 (United States / North America)

>How-To-Repeat:
	unknown
>Fix:
	unknown

>Unformatted: