Subject: Re: eek, isp crash
To: Sean Doran <smd@ebone.net>
From: Matthew Jacob <mjacob@feral.com>
List: port-alpha
Date: 03/10/1999 16:32:09
Yowza! Let's see..... the F/W returned a TRANSPORT ERROR- the target
disconnected after getting the command block- why that caused the f/w to
blow the command away- I dunno. I sure haven't seen this is in years.

The mboxcmd tiemout#2 is weird- this is a secondary problem as the timeout
code has kicked in, and it's a really weak area of the driver. So- this
failed, and then it gave up and tried to rempletely reinit the HBA, but
*that* blew chunks when it was trying to reset the adapter entirely.

This caused all subsequent commands to hang and die because the the
adapter hadn't come into runstate. Bad news, Batman... The later async
4001 means that the ISP finally woke up and and decided that  the attempt
to set initiator ID was an illegal command.

Looking more closely at the mbox failure, the dump of registers indicates
a DATA DMA fifo overrun condition.=20

What does this all mean and what can be done? Jeez- that's tougher. Can
you reproduce this at will or was it a one time?



On Thu, 11 Mar 1999, Sean Doran wrote:

>=20
> What does oen make of this?
>=20
> isp0: isp_control MBOX_ABORT failure (code 4001)
> isp0: isp_watch failed to abort command
> isp0: Ultra Mode Capable
> isp0: Board Revision 1040B, loaded F/W Revision 7.55
> isp0: Last F/W revision was 5.57
> isp0: transport error
> states->GOT_BUS GOT_TGT SENT_CDB=20
> status->Disconnect=20
> isp0: isp_mboxcmd timeout #2
> isp0: isp_control MBOX_ABORT failure (code 0)
> isp0: isp_watch failed to abort command
> isp0: Ultra Mode Capable
> isp0: Board Revision 1040B, loaded F/W Revision 7.55
> isp0: Last F/W revision was 5.57
> isp0: timeout initiated SCSI bus reset
> isp0: command for target 3 lun 0 was aborted
> isp0: command aborted for target 3 lun 0
> isp0: isp_mboxcmd timeout #2
> isp0: isp_control MBOX_ABORT failure (code 0)
> isp0: isp_watch failed to abort command
> isp0: Ultra Mode Capable
> isp0: Board Revision 1040B, loaded F/W Revision 7.55
> isp0: Last F/W revision was 5.57
> isp0: isp_mboxcmd timeout #2
> isp0: failed to set initiator id
>     biu_conf1=3D34 biu_icr=3D6 biu_isr=3D0 biu_sema=3D0 risc_hccr=3D80
>     cdma_conf=3D0 cdma_sts=3D8 cdma_fifostat=3D0
>     ddma_conf=3D0 ddma_sts=3D2108 ddma_fifostat=3D288
>     sxp_int=3D0 sxp_gross=3D4d00 sxp(scsi_ctrl)=3D6eac
>     mbox regs: 4000 7 37 0 7
> isp0: PCI Status Command/Status=3D2000157
> isp0: isp_restart cannot restart ISP
> isp0: adapter not ready
> isp0: adapter not ready
> isp0: adapter not ready
> isp0: adapter not ready
> isp0: async 4001
> isp0: adapter not ready
> isp0: adapter not ready
> isp0: adapter not ready
> isp0: adapter not ready
>=20
> subsequent boot messages:
>=20
> isp0: adapter not ready
> isp0=FAff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.
> ef.df.ee.ed.ec.f4.eb.ea.e9.e8.e7.e6.....e5.
> V6.9-4, built on Oct 12 1998 at 11:22:55
> >>>boot -fl i
> (boot dkb300.3.0.13.0 -file netbsd -flags i)
> block 0 of dkb300.3.0.13.0 is a valid boot block
> reading 15 blocks from dkb300.3.0.13.0
> bootstrap code read in
> base =3D 1f2000, image_start =3D 0, image_bytes =3D 1e00
> initializing HWRPB at 2000
> initializing page table at 1e4000
> initializing machine state
> setting affinity to the primary CPU
> jumping to bootstrap code
>=20
> NetBSD/Alpha 1.3I Primary Boot +
>=20
> .
> Jumping to entry point...
>=20
> NetBSD/Alpha Secondary Boot, Revision 1.9
> (smd@alfa.smd.ebone.net, Fri Mar  5 02:26:37 CET 1999)
>=20
> VMS PAL rev: 0x1000000010538
> OSF PAL rev: 0x100000002012e
> Switch to OSF PAL code succeeded.
>=20
> Boot file: netbsd
> Boot flags: i
> Boot file:=20
>=20
> Loading netbsd...
> 1577192+307800 [85+106560+55809]
>=20
> Entering netbsd at 0xfffffc0000301050...
> Unrecognized boot flag 'i'.
> [ preserving 163360 bytes of netbsd ELF symbol table ]
> Copyright (c) 1996, 1997, 1998, 1999
>     The NetBSD Foundation, Inc.  All rights reserved.
> Copyright (c) 1982, 1986, 1989, 1991, 1993
>     The Regents of the University of California.  All rights reserved.
>=20
> NetBSD 1.3K (SMDALFA) #0: Thu Mar  4 19:21:53 CET 1999
>     smd@alfa.smd.ebone.net:/usr/src/sys/arch/alpha/compile/SMDALFA
> AlphaStation 200 4/233, 233MHz
> 8192 byte page size, 1 processor.
> real mem =3D 83886080 (2048000 reserved for PROM, 81838080 used by NetBSD=
)
> avail mem =3D 69484544
> using 999 buffers containing 8183808 bytes of memory
> mainbus0 (root)
> cpu0 at mainbus0: ID 0 (primary), 21064A ()
> apecs0 at mainbus0: DECchip 21071 Core Logic chipset
> apecs0: DC21071-CA pass 2, 64-bit memory bus
> apecs0: DC21071-DA pass 2
> pci0 at apecs0 bus 0
> pci0: i/o enabled, memory enabled
> ncr0 at pci0 dev 6 function 0: ncr 53c810 fast10 scsi
> ncr0: interrupting at isa irq 11
> ncr0: minsync=3D25, maxsync=3D206, maxoffs=3D8, 16 dwords burst, normal d=
ma fifo
> ncr0: single-ended, open drain IRQ driver
> ncr0: restart (scsi reset).
> scsibus0 at ncr0: 8 targets, 8 luns per target
> sio0 at pci0 dev 7 function 0: Intel 82378ZB System I/O (SIO) (rev. 0x84)
> de0 at pci0 dev 11 function 0
> de0: interrupting at isa irq 5
> de0: DEC 21040 [10Mb/s] pass 2.4
> de0: address 00:00:f8:21:68:c6
> isp0 at pci0 dev 13 function 0
> isp0: Ultra Mode Capable
> isp0: Board Revision 1040B, loaded F/W Revision 7.55
> isp0: Last F/W revision was 5.57
> isp0: interrupting at isa irq 10
> isp0: driver initiated bus reset
> scsibus1 at isp0: 16 targets, 8 luns per target
> sd0 at scsibus1 targ 3 lun 0: <SEAGATE, ST15230W SUN4.2G, 0738> SCSI2 0/d=
irect fixed
> sd0: 4095MB, 3992 cyl, 19 head, 110 sec, 512 bytes/sect x 8386733 sectors
> isa0 at sio0
> com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
> com0: console
> com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
> lpt0 at isa0 port 0x3bc-0x3bf irq 7
> wss0 at isa0 port 0x530-0x537 irq 9 drq 0: CS4231A
> audio0 at wss0: half duplex
> pcppi0 at isa0 port 0x61
> isabeep0 at pcppi0
> fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
> mcclock0 at isa0 port 0x70-0x71: mc146818 or compatible
> root on sd0a dumps on sd0b
> isp0: Target 3 at 10MHz Max Offset 8, 16 bit wide, Tagged Queueing Enable=
d
> WARNING: clock gained 4 days -- CHECK AND RESET THE DATE!
> de0: enabling 10baseT port
> Enter pathname of shell or RETURN for sh:=20
>=20