Subject: Re: Dell PowerEdge 1550 hangs at boot.
To: Manuel Bouyer <bouyer@antioche.eu.org>
From: Peter Eisch <peter@boku.net>
List: port-i386
Date: 03/16/2006 10:36:43
When the device is first being discovered, on this recent reboot, it debated
for a long time whether the raid controller was there or not:

amr0 at pci5 dev 0 function 0: AMI RAID <PERC 3/DC>
amr0: interrupting at irq 11
amr0: firmware 198U, BIOS 3.35, 128MB RAM
ld0 at amr0 unit 0: RAID 5, optimal
ld0: 34556 MB, 8776 cyl, 128 head, 63 sec, 512 bytes/sect x 70770688 sectors
isp0 at pci4 dev 1 function 0: QLogic Dual Channel Ultra-3 Wide SCSI HBA
isp0: interrupting at irq 5
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x8) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x38) Timeout
isp0: Polled Mailbox Command (0x30) Timeout
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: interrupt (ISR=4 SEMA=0) when not ready
isp0: Polled Mailbox Command (0x53) Timeout
fxp1 at pci0 dev 8 function 0: i82559 Ethernet, rev 8
fxp1: interrupting at irq 11
fxp1: Ethernet address 00:90:27:a4:f6:9b

Eventually it booted.  When it got to the stall looking for root, I did the
break.  From the tr (after the handling of the break, sorry, no serial
console):

cpu_switch(c083fa40,0,c0910b28,c04185bc,0) at netbsd:cpu_switch+0x9f
ltsleep(c29a8000,11,c0732561,0c29a8008) at netbsd:ltsleep+0x33f
biowait(c29a8000,0,4,c29a0fd0,0) at netbsd:biowait+0x43
read_sector(c0910c30,0,c29a8fd8,c29a8fd8,c0910bc0) at
netbsd:read_secotr+0x48
scan_mbr ... At netbsd:scan_mbr+0x2e
readdisklabel ... At netbsd:readdisklabel+0xc8
ldgetdisklabel ... At netbsd:ldgetdisklabel+0x49
ldopen ... At netbsd:ldopen+0x14b
spec_open 
VOP_OPEN
rf_find_raid_components
rf_autoconfig
config_finalize
main


On 3/15/06 2:27 PM, "Manuel Bouyer" <bouyer@antioche.eu.org> wrote:

> On Tue, Mar 14, 2006 at 08:47:00PM +0100, Lars-Johan Liman wrote:
>> [...]
>> The kernel loads fine, and probes the devices, finding ld0 without a
>> hickup, and all is well ... UNTIL it's about to start init. The last
>> trace is
>> 
>> cd0: ... ... (all fine)
>>      <HANG>
>>      
>> ... and that's where it's supposed to say
> 
> Can you enter ddb (ctrl-alt-esc) at this point ? If so, what's the
> output of the 'tr' command ?