Subject: Re: Dell PowerEdge 1550 hangs at boot.
To: None <tls@rek.tjls.com>
From: Peter Eisch <peter@boku.net>
List: port-i386
Date: 03/16/2006 21:11:01
On 3/16/06 3:48 PM, "Thor Lancelot Simon" <tls@rek.tjls.com> wrote:

> On Thu, Mar 16, 2006 at 03:28:17PM -0600, Peter Eisch wrote:
> 
>> ppb0 at pci1 dev 2 function 0: Intel i960 RM PCI-PCI (rev. 0x02)
>> pci2 at ppb0 bus 5
>> pci2: i/o space, memory space enabled, rd/line, wr/inv ok
>> ahc1 at pci2 dev 4 function 0: Adaptec aic7899 Ultra160 SCSI adapter
>> ahc1: interrupting at irq 5
>> ahc1: aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs
>> scsibus0 at ahc1: 16 targets, 8 luns per target
>> ahc2 at pci2 dev 4 function 1: Adaptec aic7899 Ultra160 SCSI adapter
>> ahc2: interrupting at irq 11
>> ahc2: aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs
>> scsibus1 at ahc2: 16 targets, 8 luns per target
> 
> What card is this?  This looks *very* wrong.  That "i960 RM PCI-PCI"
> is not really a bus bridge, it's a CPU, generally used on RAID adapters,
> that has an integral PCI bridge.  This looks like a RAID card whose
> BIOS has failed to run for some reason, leaving it uninitialized.
> 

This is a Dell 2550 with dual 1.13MHz PIIIs, the PERC 3/DC RAID controller.
The only non-standard piece to this system is the fxp1 which, like the PERC,
is on the 3-slot riser card.  Otherwise its a typical 2550.

> Also, I find the double-probe of the 21154 bridge used on the
> MegaRAID adapter extremely odd.  I think there is something deeply
> wrong between this host's BIOS and our PCI code.  Can you try an ACPI
> kernel?
> 

With the ACPI.MP kernel it probes cd0 and when it should mount root it just
sits indefinitely.  This is different than before where it would eventually
pop to life.  The 'tr' in ddb here (after the interrupt processing stack) is
simply:

netbsd:mpidle:


Booting from a 2.0 install, 2.0 boot and 3.0 install showed basically the
same as here below with the exception in the isp0 probing there's a line:

 isp0: failed to set active negation state (1,1), (1,1)

In the BIOS I disabled the integrated two-channel Adaptec scsi and then it
would boot the ACPI kernel:

diablo> dmesg
NetBSD 3.0_STABLE (ACPI) #0: Thu Mar 16 16:15:20 CST 2006
        
peter@buster:/builds/netbsd-3/i386/obj/builds/netbsd-3/src/sys/arch/i386/com
pile/ACPI
total memory = 2047 MB
avail memory = 1996 MB
BIOS32 rev. 0 found at 0xffe90
mainbus0 (root)
cpu0 at mainbus0: apid 1 (boot processor)
cpu0: Intel Pentium III (686-class), 1130.55 MHz, id 0x6b1
cpu0: features 383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu0: features 383fbff<PGE,MCA,CMOV,PAT,PSE36,MMX>
cpu0: features 383fbff<FXSR,SSE>
cpu0: "Intel(R) Pentium(R) III CPU family      1133MHz"
cpu0: I-cache 16 KB 32B/line 4-way, D-cache 16 KB 32B/line 4-way
cpu0: L2 cache 512 KB 32B/line 8-way
cpu0: ITLB 32 4 KB entries 4-way, 2 4 MB entries fully associative
cpu0: DTLB 64 4 KB entries 4-way, 8 4 MB entries 4-way
cpu0: calibrating local timer
cpu0: apic clock running at 132 MHz
cpu0: 16 page colors
cpu1 at mainbus0: apid 0 (application processor)
cpu1: starting
cpu1: Intel Pentium III (686-class), 1130.45 MHz, id 0x6b1
cpu1: features 383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR>
cpu1: features 383fbff<PGE,MCA,CMOV,PAT,PSE36,MMX>
cpu1: features 383fbff<FXSR,SSE>
cpu1: "Intel(R) Pentium(R) III CPU family      1133MHz"
cpu1: I-cache 16 KB 32B/line 4-way, D-cache 16 KB 32B/line 4-way
cpu1: L2 cache 512 KB 32B/line 8-way
cpu1: ITLB 32 4 KB entries 4-way, 2 4 MB entries fully associative
cpu1: DTLB 64 4 KB entries 4-way, 8 4 MB entries 4-way
ioapic0 at mainbus0 apid 2 (I/O APIC)
ioapic0: pa 0xfec00000, version 11, 16 pins
ioapic0: misconfigured as apic 0
ioapic0: remapped to apic 2
ioapic1 at mainbus0 apid 3 (I/O APIC)
ioapic1: pa 0xfec01000, version 11, 16 pins
ioapic1: misconfigured as apic 0
ioapic1: remapped to apic 3
acpi0 at mainbus0
acpi0: using Intel ACPI CA subsystem version 20040211
acpi0: X/RSDT: OemId <DELL  ,PE2550  ,00000001>, AslId <MSFT,0100000a>
acpi0: SCI interrupting at int 9
acpi0: fixed-feature power button present
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
ACPI Object Type 'Processor' (0x0c) at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0C0F [PCI interrupt link device] at acpi0 not configured
PNP0A03 [PCI Bus] at acpi0 not configured
PNP0200 [AT DMA Controller] at acpi0 not configured
npx0 at acpi0 (PNP0C04)
npx0: io 0xf0-0xff irq 13
npx0: using exception 16
PNP0000 [AT Interrupt Controller] at acpi0 not configured
PNP0800 [AT-style speaker sound] at acpi0 not configured
PNP0100 [AT Timer] at acpi0 not configured
fdc0 at acpi0 (PNP0700)
fdc0: io 0x3f0-0x3f5,0x3f7 irq 6 drq 2
pckbc0 at acpi0 (PNP0303): kbd port
pckbc0: io 0x60,0x64 irq 1
pckbc1 at acpi0 (PNP0F13): aux port
pckbc1: irq 12
com0 at acpi0 (PNP0501-1)
com0: io 0x3f8-0x3ff irq 4
com0: ns16550a, working fifo
com1 at acpi0 (PNP0501-2)
com1: io 0x2f8-0x2ff irq 3
com1: ns16550a, working fifo
lpt0 at acpi0 (PNP0401)
lpt0: io 0x378-0x37f,0x778-0x77f irq 7 drq 1
PNP0B00 [AT Real-Time Clock] at acpi0 not configured
PNP0C01 [System Board] at acpi0 not configured
PNP0C01 [System Board] at acpi0 not configured
PNP0A03 [PCI Bus] at acpi0 not configured
PNP0A03 [PCI Bus] at acpi0 not configured
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0
pchb0: ServerWorks CNB20-HE PCI bridge (rev. 0x23)
pchb1 at pci0 dev 0 function 1
pchb1: ServerWorks CNB20-HE PCI bridge (rev. 0x01)
pci1 at pchb1 bus 4
pci1: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
ppb0 at pci1 dev 2 function 0: Intel i960 RM PCI-PCI (rev. 0x02)
pci2 at ppb0 bus 5
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
fxp0 at pci1 dev 4 function 0: i82559 Ethernet, rev 8
fxp0: interrupting at ioapic1 pin 0 (irq 11)
fxp0: May need receiver lock-up workaround
fxp0: Ethernet address 00:06:5b:04:a5:8f
inphy0 at fxp0 phy 1: i82555 10/100 media interface, rev. 4
inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pchb2 at pci0 dev 0 function 2
pchb2: ServerWorks CNB30-LE PCI bridge (rev. 0x01)
pchb3 at pci0 dev 0 function 3
pchb3: ServerWorks CNB30-LE PCI bridge (rev. 0x01)
pci3 at pchb3 bus 3
pci3: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
bge0 at pci3 dev 8 function 0: Broadcom BCM5700 Gigabit Ethernet
bge0: interrupting at ioapic1 pin 1 (irq 5)
bge0: ASIC BCM5700 B1 (0x7102), Ethernet address 00:06:5b:04:a5:90
brgphy0 at bge0 phy 1: BCM5401 1000BASE-T media interface, rev. 3
brgphy0: using BCM5401 DSP patch
brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT,
1000baseT-FDX, auto
ppb1 at pci0 dev 2 function 0: Digital Equipment DC21154 PCI-PCI Bridge
(rev. 0x05)
pci4 at ppb1 bus 1
pci4: i/o space, memory space enabled
ppb2 at pci4 dev 0 function 0: Digital Equipment DC21154 PCI-PCI Bridge
(rev. 0x05)
pci5 at ppb2 bus 2
pci5: i/o space, memory space enabled
amr0 at pci5 dev 0 function 0: AMI RAID <PERC 3/DC>
amr0: interrupting at ioapic1 pin 4 (irq 11)
amr0: firmware 198U, BIOS 3.35, 128MB RAM
ld0 at amr0 unit 0: RAID 5, optimal
ld0: 34556 MB, 8776 cyl, 128 head, 63 sec, 512 bytes/sect x 70770688 sectors
isp0 at pci4 dev 1 function 0: QLogic Dual Channel Ultra-3 Wide SCSI HBA
isp0: interrupting at ioapic1 pin 5 (irq 5)
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x8) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x39) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x30) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x52) Timeout
fxp1 at pci0 dev 8 function 0: i82559 Ethernet, rev 8
fxp1: interrupting at ioapic1 pin 2 (irq 11)
fxp1: Ethernet address 00:90:27:a4:f6:9b
inphy1 at fxp1 phy 1: i82555 10/100 media interface, rev. 4
inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
vga0 at pci0 dev 14 function 0: ATI Technologies Rage XL (rev. 0x27)
wsdisplay0 at vga0 kbdmux 1: console (80x25, vt100 emulation), using wskbd0
wsmux1: connecting to wsdisplay0
pcib0 at pci0 dev 15 function 0
pcib0: ServerWorks OSB4 southbridge (rev. 0x50)
rccide0 at pci0 dev 15 function 1
rccide0: ServerWorks OSB4 IDE Controller (rev. 0x00)
rccide0: bus-master DMA support present
rccide0: primary channel configured to compatibility mode
rccide0: primary channel interrupting at ioapic0 pin 14 (irq 14)
atabus0 at rccide0 channel 0
rccide0: secondary channel configured to compatibility mode
rccide0: secondary channel interrupting at ioapic0 pin 15 (irq 15)
atabus1 at rccide0 channel 1
ohci0 at pci0 dev 15 function 2: ServerWorks OSB4/CSB5 USB Host Controller
(rev. 0x04)
ohci0: interrupting at ioapic0 pin 10 (irq 10)
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: ServerWorks OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
isa0 at pcib0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: PC speaker
sysbeep0 at pcppi0
isapnp0 at isa0 port 0x279: ISA Plug 'n Play device support
isapnp0: no ISA Plug 'n Play devices found
ioapic0: enabling
ioapic1: enabling
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
Kernelized RAIDframe activated
IPsec: Initialized Security Association Processing.
atapibus0 at atabus0: 2 targets
cd0 at atapibus0 drive 0: <SAMSUNG CD-ROM SN-124, , q009> cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
cd0(rccide0:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2 (Ultra/33)
(using DMA)
boot device: ld0
root on ld0a dumps on ld0b
root file system type: ffs
cpu1: CPU 0 running
wsdisplay0: screen 1 added (80x25, vt100 emulation)
wsdisplay0: screen 2 added (80x25, vt100 emulation)
wsdisplay0: screen 3 added (80x25, vt100 emulation)
wsdisplay0: screen 4 added (80x25, vt100 emulation)
diablo> 


diablo# pcictl pci0 list
000:00:0: ServerWorks CNB20-HE PCI bridge (host bridge, revision 0x23)
000:00:1: ServerWorks CNB20-HE PCI bridge (host bridge, revision 0x01)
000:00:2: ServerWorks CNB30-LE PCI bridge (host bridge, revision 0x01)
000:00:3: ServerWorks CNB30-LE PCI bridge (host bridge, revision 0x01)
000:02:0: Digital Equipment DC21154 PCI-PCI Bridge (PCI bridge, revision
0x05)
000:08:0: Intel 82557 Fast Ethernet LAN Controller (ethernet network,
revision 0x08)
000:14:0: ATI Technologies Rage XL (VGA display, revision 0x27)
000:15:0: ServerWorks OSB4 southbridge (ISA bridge, revision 0x50)
000:15:1: ServerWorks OSB4 IDE (IDE mass storage, interface 0x8a)
000:15:2: ServerWorks OSB4/CSB5 USB Host Controller (USB serial bus,
interface 0x10, revision 0x04)
diablo# pcictl pci1 list
004:02:0: Intel i960 RM PCI-PCI (PCI bridge, revision 0x02)
004:04:0: Intel 82557 Fast Ethernet LAN Controller (ethernet network,
revision 0x08)
diablo# pcictl pci2 list
diablo# pcictl pci3 list
003:08:0: Broadcom BCM5700 10/100/1000 Ethernet (ethernet network, revision
0x12)
diablo# pcictl pci4 list
001:00:0: Digital Equipment DC21154 PCI-PCI Bridge (PCI bridge, revision
0x05)
001:01:0: Q Logic product 0x1216 (SCSI mass storage, revision 0x06)
diablo# pcictl pci5 list
002:00:0: American Megatrends MegaRAID 3 (RAID mass storage, revision 0x20)
diablo# 

I'll probably use the system even with its stochastic behavior.  If there
are any ideas, I'm game to try.

peter