Subject: Compaq Alpha 1.6.1_STABLE crash
To: None <netbsd-help@netbsd.org>
From: Johan A. van Zanten <johan@giantfoo.org>
List: netbsd-help
Date: 10/10/2003 08:56:37
 Greetings.  I had an Alpha running NetBSD 1.6.1-STABLE crash last night.
It's a mail, httpd and shell server.  Pretty light load.  I've seen it
crash to db once before, but i'm pretty sure that was because it's boot
disk had some unrecoverable media errors, and i replaced that drive.

 Here's the output from "trace" from db, and the boot-time messages.

Anyone have any ideas what the cause of the crash may have been?

 -johan


db> tr
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0x168
trap() at trap+0x5f4
XentIF() at XentIF+0x20
--- instruction fault (from ipl 0) ---
sched_qs() at sched_qs+0x54
hook_proc_run() at hook_proc_run+0x70
prologue botch: displacement 32
frame size botch: adjust register offsets?
hook_proc_run() at hook_proc_run+0x40
hook_proc_run() at hook_proc_run+0x40
doexithooks() at doexithooks+0x1c
exit1() at exit1+0x168
sys_exit() at sys_exit+0x24
syscall_plain() at syscall_plain+0x154
XentSys() at XentSys+0x58
--- syscall (1) ---
--- user mode ---
db> sync
syncing disks... tlp0: receive ring overrun
8 8 7 7 7 6 6 5 5 5 4 4 4 4 3 3 2 2 1 done

dump to dev 8,1 not possible
rebooting...


halted CPU 0

halt code = 5
HALT instruction executed
PC = fffffc0000300118

CPU 0 booting

Resetting I/O buses...
(boot dka100.1.0.15.0 -flags a)
block 0 of dka100.1.0.15.0 is a valid boot block
reading 15 blocks from dka100.1.0.15.0
bootstrap code read in
base = 200000, image_start = 0, image_bytes = 1e00(7680)
initializing HWRPB at 2000
initializing page table at 3ff3a000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code

NetBSD/alpha 1.6.1_STABLE FFS Primary Bootstrap
Jumping to entry point...

NetBSD/alpha 1.6.1 Secondary Bootstrap, Revision 1.13
(toor@sarasvati, Thu May  1 18:11:30 CDT 2003)

VMS PAL rev: 0x1005200010160
OSF PAL rev: 0x100480002015a
Switch to OSF PAL code succeeded.

Boot flags: a
3248680+301408 [172344+95974]=0x3a4768

Entering netbsd at 0xfffffc0000301290...
[ using 269280 bytes of netbsd ELF symbol table ]
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.6.1 (BAGELSPIT) #1: Wed Apr 16 11:19:53 CDT 2003
    johan@sarasvati:/local/src/NetBSD/NetBSD-1.6/usr/src/sys/arch/alpha/compileT
COMPAQ AlphaServer DS10 466 MHz, s/n 4008DQMZ11
8192 byte page size, 1 processor.
total memory = 1024 MB
(2848 KB reserved for PROM, 1021 MB used by NetBSD)
avail memory = 946 MB
using 6548 buffers containing 52384 KB of memory
mainbus0 (root)
cpu0 at mainbus0: ID 0 (primary), 21264-4
cpu0: Architecture extensions: 303<PAT,MVI,FIX,BWX>
tsc0 at mainbus0: 21272 Core Logic Chipset, Cchip rev 0
tsc0: 2 Dchips, 1 memory bus of 16 bytes
tsc0: arrays present: 512MB (split), 512MB, 0MB, 0MB, Dchip 0 rev 1
tsp0 at tsc0
pci0 at tsp0 bus 0
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
sio0 at pci0 dev 7 function 0: Acer Labs M1543 PCI-ISA Bridge (rev. 0xc3)
tlp0 at pci0 dev 9 function 0: DECchip 21143 Ethernet, pass 4.1
tlp0: interrupting at dec 6600 irq 29
tlp0: DEC , Ethernet address 08:00:2b:86:5e:c6
tlp0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
tlp1 at pci0 dev 11 function 0: DECchip 21143 Ethernet, pass 4.1
tlp1: interrupting at dec 6600 irq 30
tlp1: DEC , Ethernet address 08:00:2b:86:5e:c7
tlp1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pciide0 at pci0 dev 13 function 0: Acer Labs M5229 UDMA IDE Controller (rev. 0x)
pciide0: bus-master DMA support present
pciide0: primary channel wired to compatibility mode
atapibus0 at pciide0 channel 0: 2 targets
cd0 at atapibus0 drive 0: <COMPAQ  CDR-8435, , 0013> type 5 cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2
pciide0: primary channel interrupting at isa irq 14
cd0(pciide0:0:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
pciide0: secondary channel wired to compatibility mode
pciide0: disabling secondary channel (no drives)
isp0 at pci0 dev 15 function 0: QLogic 1020 Ultra Wide SCSI HBA
isp0: interrupting at dec 6600 irq 39
scsibus0 at isp0: 16 targets, 8 luns per target
isa0 at sio0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
lpt0 at isa0 port 0x3bc-0x3bf irq 7
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
isabeep0 at pcppi0
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
mcclock0 at isa0 port 0x70-0x71: mc146818 or compatible
scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 1 lun 0: <IBM, DDYS-T09170N, S93E> SCSI3 0/direct fixed
sd0: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x 17916240 sectors
sd0: sync (50.0ns offset 8), 16-bit (40.000MB/s) transfers, tagged queueing
sd1 at scsibus0 target 3 lun 0: <IBM, DDYS-T09170N, S93E> SCSI3 0/direct fixed
sd1: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x 17916240 sectors
sd1: sync (50.0ns offset 8), 16-bit (40.000MB/s) transfers, tagged queueing
root on sd0a dumps on sd0b
root file system type: ffs
Fri Oct 10 03:08:28 GMT 2003
swapctl: adding /dev/sd0b as swap device at priority 0