Subject: Re: Alpha DS10 Hanging on Generic 1.5.3 kernel
To: Dave McGuire <mcguire@neurotica.com>
From: Johan A. van Zanten <johan@ewranglers.com>
List: port-alpha
Date: 08/05/2002 00:32:00
---In message <15693.35424.608357.979498@osiris.neurotica.com>

>  Ahh!  I knew this sounded familiar.  I had to comment out the ochi
>driver in my kernel config file to get a DS10 booted into 1.5.2 some
>time ago.
>
>      -Dave

Thanks Dave, that actually gave me an important clue.  I plugged a spare
USB mouse into one of the USB ports, and the DS10 came up multi-user
running the GENERIC kernel. However, there were still some errors
associated with the USB hardware, during the boot, (See far below for
dmesg output from the successful boot.)


 So i immediately NFS mounted 1.5.3 source from brahma (SPARC-5 running
1.5.3), in order to compile a new kernel without USB support.

 From ksh, i started an "rm -r /usr/src/sys/arch/alpha/compile/SARASVATI
&" going. Horribly inefficient -- i shouldn't be doing am rm -r via NFS,
but i let it go as a test to make sure root had write access to the NFS
mounted file system.

 I was just about to edit /etc/rc.conf (unrelated to compiling a new
kernel; just keeping myself busy while waiting for the rm -r to run), when
Crash-Ola!

The machine is still sitting at "db>"

 I've been looking seaching www.netbsd.org for something i can do to help
identify the problem, but no luck so far.  Is there anything useful i can
do beside "trace"?

# vi /etc/rc    
fatal kernel trap:

    trap entry = 0x2 (memory management fault)
    a0         = 0x70
    a1         = 0x1
    a2         = 0x0
    pc         = 0xfffffc000057b9e8
    ra         = 0xfffffc000057b854
    curproc    = 0xfffffc0008636508
        pid = 252, comm = nfsio

panic: trap
Stopped in nfsio at     cpu_Debugger+0x4:       ret     zero,(ra)
db> trace
cpu_Debugger() at cpu_Debugger+0x4
panic() at panic+0xfc
trap() at trap+0x51c
XentMM() at XentMM+0x20
--- memory management fault (from ipl 4) ---
tulip_tx_intr() at tulip_tx_intr+0x208
tulip_txput() at tulip_txput+0x130
tulip_ifstart_one() at tulip_ifstart_one+0x58
ether_output() at ether_output+0x964
ip_output() at ip_output+0x13f8
udp_output() at udp_output+0x43c
udp_usrreq() at udp_usrreq+0x32c
sosend() at sosend+0x904
nfs_send() at nfs_send+0xe4
nfs_request() at nfs_request+0x470
nfs_writerpc() at nfs_writerpc+0xb14
nfs_doio() at nfs_doio+0x67c
nfssvc_iod() at nfssvc_iod+0x1e4
start_nfsio() at start_nfsio+0x1c
esigcode() at esigcode
--- root of call graph ---


 --johan

dmesg output:

[ preserving 453608 bytes of netbsd ELF symbol table ]
consinit: not using prom console
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.5.3 (GENERIC) #59: Mon Jul  1 21:23:57 MEST 2002
    he@albatross.urc.uninett.no:/usr/src/sys/arch/alpha/compile/GENERIC
COMPAQ AlphaServer DS10 466 MHz
8192 byte page size, 1 processor.
total memory = 1024 MB
(2800 KB reserved for PROM, 1021 MB used by NetBSD)
avail memory = 943 MB
using 6548 buffers containing 52384 KB of memory
mainbus0 (root)
cpu0 at mainbus0: ID 0 (primary), 21264-4 (pass 3)
cpu0: Architecture extensions: 303<PAT,MVI,FIX,BWX>
tsc0 at mainbus0: 21272 Core Logic Chipset, Cchip rev 0
tsc0: 2 Dchips, 1 memory bus of 16 bytes
tsc0: arrays present: 512MB, 512MB, 0MB, 0MB, Dchip 0 rev 1
tsp0 at tsc0
pci0 at tsp0 bus 0
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
ohci0 at pci0 dev 1 function 0: Acer Labs M5237 USB Host Controller (rev. 0x03)
ohci0: interrupting at isa irq 10
ohci0: OHCI version 1.0, legacy support
usb0 at ohci0: USB revision 1.0
uhub0 at usb0
uhub0: Acer Labs OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
sio0 at pci0 dev 7 function 0: Acer Labs M1543 PCI-ISA Bridge (rev. 0xc3)
de0 at pci0 dev 9 function 0
de0: interrupting at dec 6600 irq 29
de0: DEC 21143 [10-100Mb/s] pass 4.1
de0: address 08:00:2b:86:77:93
de1 at pci0 dev 11 function 0
de1: interrupting at dec 6600 irq 30
de1: DEC 21143 [10-100Mb/s] pass 4.1
de1: address 08:00:2b:86:77:a8
de1: enabling 10baseT port
pciide0 at pci0 dev 13 function 0: Acer Labs M5229 UDMA IDE Controller (rev. 0xc
1)
pciide0: bus-master DMA support present
pciide0: primary channel wired to compatibility mode
pciide0: disabling primary channel (no drives)
pciide0: secondary channel wired to compatibility mode
atapibus0 at pciide0 channel 1
cd0 at atapibus0 drive 0: <COMPAQ  CDR-8435, , 0013> type 5 cdrom removable
cd0: 32-bit data port
cd0: drive supports PIO mode 4, DMA mode 2
pciide0: secondary channel interrupting at isa irq 15
cd0(pciide0:1:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
siop0 at pci0 dev 15 function 0: Symbios Logic 53c895 (ultra2-wide scsi)
siop0: using on-board RAM
siop0: interrupting at dec 6600 irq 39
scsibus0 at siop0: 16 targets, 8 luns per target
isa0 at sio0
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
pckbc0 at isa0 port 0x60-0x64
lpt0 at isa0 port 0x3bc-0x3bf irq 7
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
isabeep0 at pcppi0
fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
mcclock0 at isa0 port 0x70-0x71: mc146818 or compatible
siop0: switching to single-ended mode
scsibus0: waiting 2 seconds for devices to settle...
siop0: target 0 using tagged queuing
sd0 at scsibus0 target 0 lun 0: <IBM, DDYS-T09170N, S93E> SCSI3 0/direct fixed
siop0: target 0 using 16bit transfers
siop0: target 0 now synchronous at 20.0Mhz, offset 31
sd0: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x 17916240 sectors
siop0: target 1 using tagged queuing
sd1 at scsibus0 target 1 lun 0: <IBM, DDYS-T09170N, S93E> SCSI3 0/direct fixed
siop0: target 1 using 16bit transfers
siop0: target 1 now synchronous at 20.0Mhz, offset 31
sd1: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x 17916240 sectors
de0: enabling 10baseT port
stray isa irq 10
stray isa irq 10
stray isa irq 10
stray isa irq 10
stray isa irq 10; stopped logging
uhub0: device problem, disabling port 1
root on sd0a dumps on sd0b
root file system type: ffs
de0: enabling 10baseT port