Port-amd64 archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

HP Proliant DL360 gen9 crashes when booted with ACPI



I have an HP ProLiant DL360 gen9 which works fine in 7.0_STABLE when booted without ACPI---except that only one processor core is visible. When I try to boot with ACPI enabled, it crashes just after beginning scsi probes[1].

If I attempt the same thing in today's -current, it fails a diagnostic assertion in pmap.c, approximately at the same point in the boot process [2].

Aside from getting a serial console hooked up so I have some hope of interacting with DDB or getting complete boot messages, is there anything else I might try to gather more information about this problem? I've included full dmesg output from the non-ACPI case of 7.0_STABLE below [3].

I'm running a custom kernel in [3], but the kernel config simply disables ACPI and a few compat options and devices we don't use [4]. It fails in much the same way when running GENERIC.

Thanks,

-Konrad

--

[1]: 7.0_STABLE boot failure.  Transcribed from a photo:

...
vendor 0x8086 product 0x6f98 (miscellaneous system, revision 0x01) at pci16 dev 30 function 0 not configured vendor 0x8086 product 0x6f99 (miscellaneous system, revision 0x01) at pci16 dev 30 function 1 not configured vendor 0x8086 product 0x6f9a (miscellaneous system, revision 0x01) at pci16 dev 30 function 2 not configured vendor 0x8086 product 0x6fc0 (miscellaneous system, revision 0x01) at pci16 dev 30 function 3 not configured vendor 0x8086 product 0x6f9c (miscellaneous system, revision 0x01) at pci16 dev 30 function 4 not configured vendor 0x8086 product 0x6f88 (miscellaneous system, revision 0x01) at pci16 dev 31 function 0 not configured vendor 0x8086 product 0x6f8a (miscellaneous system, revision 0x01) at pci16 dev 31 function 2 not configured
scsibus0: waiting 2 seconds fro devices to settle...
uvm_fault(0xffffffff81054240, 0x0, 4) -> e
uvm_fault(0xffffffff81054240, 0x0, 1) -> e
fatal page fault in supervisor mode
trap type 6 code 0 rip ffffffff808acaba cs 8 rflags 10246 cr2 0 ilevel 0 rsp fffffe811cf2bd18
curlwp 0xfffffe847ef3f840 pid 0.2 lowest kstack 0xfffffe811cf292c0
kernel: page fault trap, code=0
Stopped in pid 0.2 (system) at  netbsd:trap+0x91f:      movzwl  0(%rax),%eax
db{0}>

[2]: today's -current:

panic: kernel diagnostic assertion "ci->ci_tlbstate != TLBSTATE_VALID" failed: file "/var/src-current/sys/arch/x86/x86/pmap.c", line 2624
fatal breakpoint trap in supervisor mode
trap type 1 code 0 rip ffffffff80215455 cs 8 rflags 246 cr2 0 ilevel 7 rsp fffffe811cf1ee00
curlwp 0xfffffe847ef33020 pid 0.7 lowest kstack 0xfffffe811cf1b2c0
Stopped in pid 0.7 (system) at  netbsd:breakpoint+0x5:  leave
db{0}>

[3]: complete dmesg.boot when booted without ACPI:

NetBSD watchtower.wnf.uw.edu 7.0_STABLE NetBSD 7.0_STABLE (HP-WATCHTOWER) #2: Mon Dec 12 15:30:36 PST 2016 root%watchtower.coral.washington.edu@localhost:/ovar/src-7/sys/arch/amd64/compile/obj/HP-WATCHTOWER amd64
watchtower# cat /var/run/dmesg.boot
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
    2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 7.0_STABLE (HP-WATCHTOWER) #2: Mon Dec 12 15:30:36 PST 2016
	root%watchtower.coral.washington.edu@localhost:/ovar/src-7/sys/arch/amd64/compile/obj/HP-WATCHTOWER
total memory = 16258 MB
avail memory = 15769 MB
kern.module.path=/stand/amd64/7.0/modules
timecounter: Timecounters tick every 10.000 msec
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
HP ProLiant DL360 Gen9
mainbus0 (root)
cpu0 at mainbus0: Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz, id 0x406f1
ipmi0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 0x8086 product 0x6f00 (rev. 0x01)
ppb0 at pci0 dev 1 function 0: vendor 0x8086 product 0x6f02 (rev. 0x01)
ppb0: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 8.0GT/s
pci1 at ppb0 bus 3
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
ciss0 at pci1 dev 0 function 0: HP Smart Array 10
ciss0: interrupting at irq 11
ciss0: 1 LD, HW rev 1, FW 4.02/4.02, 64bit fifo rro
scsibus0 at ciss0: 1 target, 1 lun per target
ppb1 at pci0 dev 1 function 1: vendor 0x8086 product 0x6f03 (rev. 0x01)
ppb1: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci2 at ppb1 bus 12
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
ppb2 at pci0 dev 2 function 0: vendor 0x8086 product 0x6f04 (rev. 0x01)
ppb2: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 8.0GT/s
pci3 at ppb2 bus 5
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
ppb3 at pci0 dev 2 function 1: vendor 0x8086 product 0x6f05 (rev. 0x01)
ppb3: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci4 at ppb3 bus 13
pci4: i/o space, memory space enabled, rd/line, wr/inv ok
ppb4 at pci0 dev 2 function 2: vendor 0x8086 product 0x6f06 (rev. 0x01)
ppb4: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x8 @ 8.0GT/s
pci5 at ppb4 bus 4
pci5: i/o space, memory space enabled, rd/line, wr/inv ok
ppb5 at pci0 dev 2 function 3: vendor 0x8086 product 0x6f07 (rev. 0x01)
ppb5: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci6 at ppb5 bus 14
pci6: i/o space, memory space enabled, rd/line, wr/inv ok
ppb6 at pci0 dev 3 function 0: vendor 0x8086 product 0x6f08 (rev. 0x01)
ppb6: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x16 @ 8.0GT/s
pci7 at ppb6 bus 8
pci7: i/o space, memory space enabled, rd/line, wr/inv ok
ppb7 at pci0 dev 3 function 1: vendor 0x8086 product 0x6f09 (rev. 0x01)
ppb7: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci8 at ppb7 bus 15
pci8: i/o space, memory space enabled, rd/line, wr/inv ok
ppb8 at pci0 dev 3 function 2: vendor 0x8086 product 0x6f0a (rev. 0x01)
ppb8: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci9 at ppb8 bus 11
pci9: i/o space, memory space enabled, rd/line, wr/inv ok
ppb9 at pci0 dev 3 function 3: vendor 0x8086 product 0x6f0b (rev. 0x01)
ppb9: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x4 @ 8.0GT/s
pci10 at ppb9 bus 16
pci10: i/o space, memory space enabled, rd/line, wr/inv ok
vendor 0x8086 product 0x6f20 (miscellaneous system, revision 0x01) at pci0 dev 4 function 0 not configured vendor 0x8086 product 0x6f21 (miscellaneous system, revision 0x01) at pci0 dev 4 function 1 not configured vendor 0x8086 product 0x6f22 (miscellaneous system, revision 0x01) at pci0 dev 4 function 2 not configured vendor 0x8086 product 0x6f23 (miscellaneous system, revision 0x01) at pci0 dev 4 function 3 not configured vendor 0x8086 product 0x6f24 (miscellaneous system, revision 0x01) at pci0 dev 4 function 4 not configured vendor 0x8086 product 0x6f25 (miscellaneous system, revision 0x01) at pci0 dev 4 function 5 not configured vendor 0x8086 product 0x6f26 (miscellaneous system, revision 0x01) at pci0 dev 4 function 6 not configured vendor 0x8086 product 0x6f27 (miscellaneous system, revision 0x01) at pci0 dev 4 function 7 not configured vendor 0x8086 product 0x6f28 (miscellaneous system, revision 0x01) at pci0 dev 5 function 0 not configured vendor 0x8086 product 0x6f29 (miscellaneous system, revision 0x01) at pci0 dev 5 function 1 not configured vendor 0x8086 product 0x6f2a (miscellaneous system, revision 0x01) at pci0 dev 5 function 2 not configured vendor 0x8086 product 0x6f2c (interrupt system, IO(x) APIC, revision 0x01) at pci0 dev 5 function 4 not configured vendor 0x8086 product 0x8d7c (undefined, subclass 0x00, revision 0x05) at pci0 dev 17 function 0 not configured vendor 0x8086 product 0x8d31 (USB serial bus, xHCI, revision 0x05) at pci0 dev 20 function 0 not configured
ehci0 at pci0 dev 26 function 0: vendor 0x8086 product 0x8d2d (rev. 0x05)
ehci0: interrupting at irq 9
ehci0: BIOS has given up ownership
ehci0: EHCI version 1.0
usb0 at ehci0: USB revision 2.0
ppb10 at pci0 dev 28 function 0: vendor 0x8086 product 0x8d10 (rev. 0xd5)
ppb10: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
pci11 at ppb10 bus 17
pci11: i/o space, memory space enabled, rd/line, wr/inv ok
ppb11 at pci0 dev 28 function 2: vendor 0x8086 product 0x8d14 (rev. 0xd5)
ppb11: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
ppb11: link is x1 @ 2.5GT/s
pci12 at ppb11 bus 1
pci12: i/o space, memory space enabled, rd/line, wr/inv ok
vendor 0x103c product 0x3306 (miscellaneous system, revision 0x06) at pci12 dev 0 function 0 not configured
vga0 at pci12 dev 0 function 1: vendor 0x102b product 0x0533 (rev. 0x01)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation)
wsmux1: connecting to wsdisplay0
drm at vga0 not configured
vendor 0x103c product 0x3307 (miscellaneous system, revision 0x06) at pci12 dev 0 function 2 not configured
uhci0 at pci12 dev 0 function 4: vendor 0x103c product 0x3300 (rev. 0x03)
uhci0: interrupting at irq 5
usb1 at uhci0: USB revision 1.0
ppb12 at pci0 dev 28 function 4: vendor 0x8086 product 0x8d18 (rev. 0xd5)
ppb12: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x2 @ 5.0GT/s
pci13 at ppb12 bus 2
pci13: i/o space, memory space enabled, rd/line, wr/inv ok
bge0 at pci13 dev 0 function 0: Broadcom BCM5719 Gigabit Ethernet
bge0: interrupting at irq 11
bge0: APE firmware NCSI 1.3.12.0
bge0: HW config 002b11d4, 00006014, 0000aa38, 00000000 00000000
bge0: ASIC unknown BCM5719 (0x5719001), Ethernet address 94:18:82:09:ff:88
bge0: setting short Tx thresholds
brgphy0 at bge0 phy 1: BCM5719C 1000BASE-T media interface, rev. 0
brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
bge1 at pci13 dev 0 function 1: Broadcom BCM5719 Gigabit Ethernet
bge1: interrupting at irq 10
bge1: APE firmware NCSI 1.3.12.0
bge1: HW config 002b11d4, 00006014, 0000aa38, 00000000 00000000
bge1: ASIC unknown BCM5719 (0x5719001), Ethernet address 94:18:82:09:ff:89
bge1: setting short Tx thresholds
brgphy1 at bge1 phy 2: BCM5719C 1000BASE-T media interface, rev. 0
brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
bge2 at pci13 dev 0 function 2: Broadcom BCM5719 Gigabit Ethernet
bge2: interrupting at irq 11
bge2: APE firmware NCSI 1.3.12.0
bge2: HW config 002b11d4, 00006014, 0000aa38, 00000000 00000000
bge2: ASIC unknown BCM5719 (0x5719001), Ethernet address 94:18:82:09:ff:8a
bge2: setting short Tx thresholds
brgphy2 at bge2 phy 3: BCM5719C 1000BASE-T media interface, rev. 0
brgphy2: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
bge3 at pci13 dev 0 function 3: Broadcom BCM5719 Gigabit Ethernet
bge3: interrupting at irq 10
bge3: APE firmware NCSI 1.3.12.0
bge3: HW config 002b11d4, 00006014, 0000aa38, 00000000 00000000
bge3: ASIC unknown BCM5719 (0x5719001), Ethernet address 94:18:82:09:ff:8b
bge3: setting short Tx thresholds
brgphy3 at bge3 phy 4: BCM5719C 1000BASE-T media interface, rev. 0
brgphy3: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ppb13 at pci0 dev 28 function 6: vendor 0x8086 product 0x8d1c (rev. 0xd5)
ppb13: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
pci14 at ppb13 bus 18
pci14: i/o space, memory space enabled, rd/line, wr/inv ok
ppb14 at pci0 dev 28 function 7: vendor 0x8086 product 0x8d1e (rev. 0xd5)
ppb14: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
pci15 at ppb14 bus 19
pci15: i/o space, memory space enabled, rd/line, wr/inv ok
ehci1 at pci0 dev 29 function 0: vendor 0x8086 product 0x8d26 (rev. 0x05)
ehci1: interrupting at irq 9
ehci1: BIOS has given up ownership
ehci1: EHCI version 1.0
usb2 at ehci1: USB revision 2.0
ichlpcib0 at pci0 dev 31 function 0: vendor 0x8086 product 0x8d44 (rev. 0x05)
timecounter: Timecounter "ichlpcib0" frequency 3579545 Hz quality 1000
ichlpcib0: 24-bit timer
ichlpcib0: TCO timer reboot disabled by hardware; hope SMBIOS properly handles it.
ichlpcib0: TCO (watchdog) timer configured.
ichsmb0 at pci0 dev 31 function 3: vendor 0x8086 product 0x8d22 (rev. 0x05)
ichsmb0: interrupting at irq 9
iic0 at ichsmb0: I2C bus
isa0 at ichlpcib0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
attimer0 at isa0 port 0x40-0x43
pcppi0 at isa0 port 0x61
midi0 at pcppi0: PC speaker
sysbeep0 at pcppi0
attimer0: attached to pcppi0
coretemp0 at cpu0: thermal sensor, 1 C resolution, Tjmax=90
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "TSC" frequency 2097662070 Hz quality 3000
scsibus0: waiting 2 seconds for devices to settle...
IPsec: Initialized Security Association Processing.
uhub0 at usb0: vendor 0x8086 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub1 at usb1: vendor 0x103c UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhub2 at usb2: vendor 0x8086 EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
sd0 at scsibus0 target 0 lun 0: <HP, LOGICAL VOLUME, 4.02> disk fixed
sd0: 7451 GB, 65535 cyl, 255 head, 935 sec, 512 bytes/sect x 15627854000 sectors
sd0: GPT GUID: 7316c01f-456c-405d-93be-d76edecc9054
dk0 at sd0: 139299df-7615-4032-87ac-7c350701f04b
dk0: 8389584 blocks at 34, type: ffs
dk1 at sd0: adbc6c35-521e-460f-9bf4-23ee40943223
dk1: 33299280 blocks at 8389618, type: swap
dk2 at sd0: 1bdd8372-e8d8-497e-bb63-84868262b8a2
dk2: 16778160 blocks at 41688898, type: ffs
dk3 at sd0: 735212e7-ef62-4efc-bab4-339330a9b1ca
dk3: 15569386909 blocks at 58467058, type: ffs
sd0: async, 8-bit transfers, tagged queueing
uhub3 at uhub2 port 1: vendor 0x8087 product 0x8002, class 9/0, rev 2.00/0.05, addr 2
uhub3: single transaction translator
uhub4 at uhub0 port 1: vendor 0x8087 product 0x800a, class 9/0, rev 2.00/0.05, addr 2
uhub4: single transaction translator
uhub3: 8 ports with 8 removable, self powered
uhub4: 6 ports with 6 removable, self powered
uhub5 at uhub3 port 3: vendor 0x0424 product 0x2660, class 9/0, rev 2.00/8.01, addr 3
uhub5: single transaction translator
uhub5: 2 ports with 1 removable, self powered
uhidev0 at uhub4 port 5 configuration 1 interface 0
uhidev0: Avocent HP 336047-B21, rev 1.10/1.00, addr 3, iclass 3/1
ukbd0 at uhidev0: 8 modifier keys, 6 key codes
wskbd0 at ukbd0: console keyboard, using wsdisplay0
uhidev1 at uhub4 port 5 configuration 1 interface 1
uhidev1: Avocent HP 336047-B21, rev 1.10/1.00, addr 3, iclass 3/1
uhidev1: 3 report ids
ums0 at uhidev1 reportid 1: 5 buttons and Z dir
wsmouse0 at ums0 mux 0
uhid0 at uhidev1 reportid 2: input=2, output=0, feature=0
uhid1 at uhidev1 reportid 3: input=1, output=0, feature=0
uhidev2 at uhub4 port 6 configuration 1 interface 0
uhidev2: vendor 0x413c Dell KB216 Wired Keyboard, rev 2.00/1.10, addr 4, iclass 3/1
ukbd1 at uhidev2: 8 modifier keys, 6 key codes
wskbd1 at ukbd1 mux 1
wskbd1: connecting to wsdisplay0
uhidev3 at uhub4 port 6 configuration 1 interface 1
uhidev3: vendor 0x413c Dell KB216 Wired Keyboard, rev 2.00/1.10, addr 4, iclass 3/0
uhidev3: 2 report ids
uhid2 at uhidev3 reportid 1: input=1, output=0, feature=0
uhid3 at uhidev3 reportid 2: input=2, output=0, feature=0
pad0: outputs: 44100Hz, 16-bit, stereo
audio0 at pad0: half duplex, playback, capture
boot device: sd0
root on dk0 dumps on dk1
root file system type: ffs
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)

[4] Kernel config file:

include         "arch/amd64/conf/GENERIC"

# Has no effect, bummer...
#options        NMBCLUSTERS=32768

options         DIAGNOSTIC      # expensive kernel consistency checks

# Compatibility
#options        COMPAT_60       # NetBSD 6.0
#options        COMPAT_50       # NetBSD 5.0
no options      COMPAT_40       # NetBSD 4.0
no options      COMPAT_30       # NetBSD 3.0
no options      COMPAT_20       # NetBSD 2.0
no options      COMPAT_16       # NetBSD 1.6
no options      COMPAT_15       # NetBSD 1.5

# Other OS compat
no options      COMPAT_OSSAUDIO # OSS (Voxware) audio driver compatibility
no options      COMPAT_LINUX    # binary compatibility with Linux

# File systems
no file-system  EXT2FS          # second extended file system (linux)
no file-system  LFS             # log-structured file system
no file-system  NULLFS          # loopback file system
no file-system  OVERLAY         # overlay file system
no file-system  UMAPFS          # NULLFS + uid and gid remapping
no file-system  UNION           # union file system
no file-system  CODA            # Coda File System; also needs vcoda (below)
no file-system SMBFS # experimental - CIFS; also needs nsmb (below)

# File system options
no options      QUOTA           # legacy UFS quotas
no options      QUOTA2          # new, in-filesystem UFS quotas

# Networking options
options         GATEWAY         # packet forwarding
no options      INET6           # IPv6
#options        IPSEC           # IP security
#options IPSEC_ESP # IP security (encryption part; define w/IPSEC) no options PPP_FILTER # Active filter support for PPP (requires bpf)

# Remove unused devices
no acpi0           at mainbus0
no acpicpu*        at cpu?

# disk/mass storage pseudo-devices
no pseudo-device        ccd
no pseudo-device        cgd
no pseudo-device        raid
no options      RAID_AUTOCONFIG
no pseudo-device        md
no options      MEMORY_DISK_HOOKS
no options      MEMORY_DISK_DYNAMIC
no pseudo-device        ppp
no pseudo-device        pppoe
no pseudo-device        sl
no pseudo-device        strip
no pseudo-device        irframetty
no pseudo-device        stf
no pseudo-device        bridge
pseudo-device   pf
pseudo-device   pflog
no pseudo-device        sequencer
no pseudo-device        vcoda
no pseudo-device        nsmb

# Veriexec
no options      FILEASSOC
no pseudo-device        veriexec


--
-----------------------------------------------------------------------
Konrad Schroder              Box 352143 -or- 135 Fluke Hall, Mason Road
Software Developer and Sys Admin               University of Washington
College of Engineering                          Seattle, WA, 98195, USA


Home | Main Index | Thread Index | Old Index