Subject: RE: Alpha DS10 Hanging on Generic 1.5.3 kernel
To: None <port-alpha@netbsd.org>
From: Juergen Weiss <weiss@uni-mainz.de>
List: port-alpha
Date: 08/05/2002 10:45:11
I had the same problem on a pc164 board with nfs and a tulip.
The crash is reproducible. Trace looked the same. I did
not check if the problem goes away with a non tulip card.

Juergen Weiss

Juergen Weiss	  | Universitaet Mainz, Zentrum fuer Datenverarbeitung,
weiss@uni-mainz.de| 55099 Mainz, Tel: +49(6131)39-26361, FAX:
+49(6131)39-26407


> -----Original Message-----
> From: Johan A. van Zanten [mailto:johan@ewranglers.com]=20
> Sent: Monday, August 05, 2002 6:32 AM
> To: Dave McGuire; port-alpha@netbsd.org
> Subject: Re: Alpha DS10 Hanging on Generic 1.5.3 kernel
>=20
>=20
>=20
> ---In message <15693.35424.608357.979498@osiris.neurotica.com>
>=20
> >  Ahh!  I knew this sounded familiar.  I had to comment out the ochi
> >driver in my kernel config file to get a DS10 booted into 1.5.2 some
> >time ago.
> >
> >      -Dave
>=20
> Thanks Dave, that actually gave me an important clue.  I=20
> plugged a spare
> USB mouse into one of the USB ports, and the DS10 came up multi-user
> running the GENERIC kernel. However, there were still some errors
> associated with the USB hardware, during the boot, (See far below for
> dmesg output from the successful boot.)
>=20
>=20
>  So i immediately NFS mounted 1.5.3 source from brahma=20
> (SPARC-5 running
> 1.5.3), in order to compile a new kernel without USB support.
>=20
>  From ksh, i started an "rm -r=20
> /usr/src/sys/arch/alpha/compile/SARASVATI
> &" going. Horribly inefficient -- i shouldn't be doing am rm=20
> -r via NFS,
> but i let it go as a test to make sure root had write access=20
> to the NFS
> mounted file system.
>=20
>  I was just about to edit /etc/rc.conf (unrelated to compiling a new
> kernel; just keeping myself busy while waiting for the rm -r=20
> to run), when
> Crash-Ola!
>=20
> The machine is still sitting at "db>"
>=20
>  I've been looking seaching www.netbsd.org for something i=20
> can do to help
> identify the problem, but no luck so far.  Is there anything=20
> useful i can
> do beside "trace"?
>=20
> # vi /etc/rc   =20
> fatal kernel trap:
>=20
>     trap entry =3D 0x2 (memory management fault)
>     a0         =3D 0x70
>     a1         =3D 0x1
>     a2         =3D 0x0
>     pc         =3D 0xfffffc000057b9e8
>     ra         =3D 0xfffffc000057b854
>     curproc    =3D 0xfffffc0008636508
>         pid =3D 252, comm =3D nfsio
>=20
> panic: trap
> Stopped in nfsio at     cpu_Debugger+0x4:       ret     zero,(ra)
> db> trace
> cpu_Debugger() at cpu_Debugger+0x4
> panic() at panic+0xfc
> trap() at trap+0x51c
> XentMM() at XentMM+0x20
> --- memory management fault (from ipl 4) ---
> tulip_tx_intr() at tulip_tx_intr+0x208
> tulip_txput() at tulip_txput+0x130
> tulip_ifstart_one() at tulip_ifstart_one+0x58
> ether_output() at ether_output+0x964
> ip_output() at ip_output+0x13f8
> udp_output() at udp_output+0x43c
> udp_usrreq() at udp_usrreq+0x32c
> sosend() at sosend+0x904
> nfs_send() at nfs_send+0xe4
> nfs_request() at nfs_request+0x470
> nfs_writerpc() at nfs_writerpc+0xb14
> nfs_doio() at nfs_doio+0x67c
> nfssvc_iod() at nfssvc_iod+0x1e4
> start_nfsio() at start_nfsio+0x1c
> esigcode() at esigcode
> --- root of call graph ---
>=20
>=20
>  --johan
>=20
> dmesg output:
>=20
> [ preserving 453608 bytes of netbsd ELF symbol table ]
> consinit: not using prom console
> Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001
>     The NetBSD Foundation, Inc.  All rights reserved.
> Copyright (c) 1982, 1986, 1989, 1991, 1993
>     The Regents of the University of California.  All rights reserved.
>=20
> NetBSD 1.5.3 (GENERIC) #59: Mon Jul  1 21:23:57 MEST 2002
>    =20
> he@albatross.urc.uninett.no:/usr/src/sys/arch/alpha/compile/GENERIC
> COMPAQ AlphaServer DS10 466 MHz
> 8192 byte page size, 1 processor.
> total memory =3D 1024 MB
> (2800 KB reserved for PROM, 1021 MB used by NetBSD)
> avail memory =3D 943 MB
> using 6548 buffers containing 52384 KB of memory
> mainbus0 (root)
> cpu0 at mainbus0: ID 0 (primary), 21264-4 (pass 3)
> cpu0: Architecture extensions: 303<PAT,MVI,FIX,BWX>
> tsc0 at mainbus0: 21272 Core Logic Chipset, Cchip rev 0
> tsc0: 2 Dchips, 1 memory bus of 16 bytes
> tsc0: arrays present: 512MB, 512MB, 0MB, 0MB, Dchip 0 rev 1
> tsp0 at tsc0
> pci0 at tsp0 bus 0
> pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
> ohci0 at pci0 dev 1 function 0: Acer Labs M5237 USB Host=20
> Controller (rev. 0x03)
> ohci0: interrupting at isa irq 10
> ohci0: OHCI version 1.0, legacy support
> usb0 at ohci0: USB revision 1.0
> uhub0 at usb0
> uhub0: Acer Labs OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
> uhub0: 2 ports with 2 removable, self powered
> sio0 at pci0 dev 7 function 0: Acer Labs M1543 PCI-ISA Bridge=20
> (rev. 0xc3)
> de0 at pci0 dev 9 function 0
> de0: interrupting at dec 6600 irq 29
> de0: DEC 21143 [10-100Mb/s] pass 4.1
> de0: address 08:00:2b:86:77:93
> de1 at pci0 dev 11 function 0
> de1: interrupting at dec 6600 irq 30
> de1: DEC 21143 [10-100Mb/s] pass 4.1
> de1: address 08:00:2b:86:77:a8
> de1: enabling 10baseT port
> pciide0 at pci0 dev 13 function 0: Acer Labs M5229 UDMA IDE=20
> Controller (rev. 0xc
> 1)
> pciide0: bus-master DMA support present
> pciide0: primary channel wired to compatibility mode
> pciide0: disabling primary channel (no drives)
> pciide0: secondary channel wired to compatibility mode
> atapibus0 at pciide0 channel 1
> cd0 at atapibus0 drive 0: <COMPAQ  CDR-8435, , 0013> type 5=20
> cdrom removable
> cd0: 32-bit data port
> cd0: drive supports PIO mode 4, DMA mode 2
> pciide0: secondary channel interrupting at isa irq 15
> cd0(pciide0:1:0): using PIO mode 4, DMA mode 2 (using DMA=20
> data transfers)
> siop0 at pci0 dev 15 function 0: Symbios Logic 53c895=20
> (ultra2-wide scsi)
> siop0: using on-board RAM
> siop0: interrupting at dec 6600 irq 39
> scsibus0 at siop0: 16 targets, 8 luns per target
> isa0 at sio0
> com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
> com0: console
> com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
> pckbc0 at isa0 port 0x60-0x64
> lpt0 at isa0 port 0x3bc-0x3bf irq 7
> pcppi0 at isa0 port 0x61
> spkr0 at pcppi0
> isabeep0 at pcppi0
> fdc0 at isa0 port 0x3f0-0x3f7 irq 6 drq 2
> fd0 at fdc0 drive 0: 1.44MB, 80 cyl, 2 head, 18 sec
> mcclock0 at isa0 port 0x70-0x71: mc146818 or compatible
> siop0: switching to single-ended mode
> scsibus0: waiting 2 seconds for devices to settle...
> siop0: target 0 using tagged queuing
> sd0 at scsibus0 target 0 lun 0: <IBM, DDYS-T09170N, S93E>=20
> SCSI3 0/direct fixed
> siop0: target 0 using 16bit transfers
> siop0: target 0 now synchronous at 20.0Mhz, offset 31
> sd0: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x=20
> 17916240 sectors
> siop0: target 1 using tagged queuing
> sd1 at scsibus0 target 1 lun 0: <IBM, DDYS-T09170N, S93E>=20
> SCSI3 0/direct fixed
> siop0: target 1 using 16bit transfers
> siop0: target 1 now synchronous at 20.0Mhz, offset 31
> sd1: 8748 MB, 15110 cyl, 3 head, 395 sec, 512 bytes/sect x=20
> 17916240 sectors
> de0: enabling 10baseT port
> stray isa irq 10
> stray isa irq 10
> stray isa irq 10
> stray isa irq 10
> stray isa irq 10; stopped logging
> uhub0: device problem, disabling port 1
> root on sd0a dumps on sd0b
> root file system type: ffs
> de0: enabling 10baseT port
>=20