Subject: Question about hardware failure under netbsd-1.3_BETA
To: None <port-sparc@NetBSD.ORG, current-users@NetBSD.ORG>
From: Brian Buhrow <buhrow@cats.ucsc.edu>
List: current-users
Date: 01/05/1998 18:04:20
	Hello Fellow NetBSD users.  I have a question which is hardware
specific, but not necessarily port specific.  
	I have a NetBSD/sparc (Sparc 5) machine which has been running stably
for about a month.  Over the course of the last week, it has begun
rebooting ever more frequently.  Note that I have made no changes in the
software running on this box.  What I'm seeing is a kernel data fault with
the pc at the same value over and over again.  It's not always the same
value, but mostly it is.  Here are my questions:

1.  Does the address reported by the kernel represent a virtual address?
How, as I suspect, can I tell if the address is a problem with cache or a
problem with physical memory?

2.  Can I tell, for the Sparc, or for any other architecture for that
matter, if the problem is with memory, on-chip cache, or external cache?

Here's what I'm seeing, in case anyone has ideas.

Thanks in advance.
-Brian


NetBSD/sparc (news) (console)

login: 

NetBSD/sparc (news) (console)

login: data fault: pc=0xf014f4b4 addr=0xef44afe8 sfsr=226<FAV>
panic: kernel fault
syncing disks... 93 92 76 52 37 19 1 1 1 1 1 1 1 1 1 1 1 1 1 1 giving up
Frame pointer is at 0xf85cd8e0
Call traceback:
  pc = 0xf012dbe8  args = (0x0, 0x4000fe0, 0xf016e800, 0xf85cda00, 0xf85cd990, 0x0, 0xf85cd948) fp = 0xf85cd948
  pc = 0xf00383c4  args = (0x100, 0x0, 0x1, 0xf85cda6c, 0xf85cd9f8, 0x0, 0xf85cd9b0) fp = 0xf85cd9b0
  pc = 0xf013acd8  args = (0x100, 0xf015e400, 0x1, 0xf85cda78, 0x0, 0x1, 0xf85cda18) fp = 0xf85cda18
  pc = 0xf00084fc  args = (0xf0be7700, 0x226, 0xef44afe8, 0xef44a000, 0xf014f4b4, 0xf85cdb20, 0xf85cdac0) fp = 0xf85cdac0
  pc = 0xf00d3fdc  args = (0xf12e4810, 0x1e99828, 0xef44afe8, 0x0, 0xffffffff, 0x3140, 0xf85cdb70) fp = 0xf85cdb70
  pc = 0xf00d2c28  args = (0x1e99828, 0x8c, 0x0, 0x41fd, 0x0, 0x8c, 0xf85cdbe0) fp = 0xf85cdbe0
  pc = 0xf00d280c  args = (0xf0b98e00, 0x8c, 0x1aef00, 0x41fd, 0xf00d3e14, 0x3140, 0xf85cdc48) fp = 0xf85cdc48
  pc = 0xf00e6224  args = (0xf85cdd38, 0xf00d2784, 0xf015ec00, 0xffffffff, 0x0, 0x0, 0xf85cdcc0) fp = 0xf85cdcc0
  pc = 0xf005ae10  args = (0xf85cdde0, 0xf85cdde0, 0xf015ec00, 0xf0b9e980, 0x2, 0x0, 0xf85cdd80) fp = 0xf85cdd80
  pc = 0xf013b068  args = (0x0, 0xf85cdf28, 0xf85cdf20, 0xf005ac54, 0x1b4, 0xa, 0xf85cdec0) fp = 0xf85cdec0
  pc = 0xf0008794  args = (0x88, 0xf85cdfb0, 0x0, 0xe6c, 0x68, 0x0, 0xf85cdf50) fp = 0xf85cdf50
  pc = 0xdd0c  args = (0x2dc196, 0x1fd, 0x0, 0xe6c, 0xefffeb28, 0x5378, 0xefffeab8) fp = 0xefffeab8
rebooting

Resetting ... 
screen not found.
Can't open input device.
Keyboard not present.  Using tty for input and output.

SPARCstation 5, No Keyboard
ROM Rev. 2.15 Pilot, 256 MB memory installed, Serial #3531979.
Ethernet address 8:0:20:21:76:cc, Host ID: 8035e4cb.



Initializing Memory |/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\                                                                      Initializing Memory |/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\                                                                      Rebooting with command: 
Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0  File and args: netbsd
>> NetBSD/sparc Secondary Boot, Revision 1.7
>> (pk@flambard, Mon Dec  1 11:25:00 MET 1997)
|/-\|Booting netbsd @ 0x4000
1376256/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/+117056-\|/-\|/-\|/-\+73776+[77844|/-\|/-\|/-\|/-\|/-\+91089]=0x1abd5d
console is ttya
Copyright (c) 1996, 1997 The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.3_BETA (GENERIC_SCSI3) #11: Wed Dec  3 00:47:13 MET 1997
    pk@flambard:/usr/src1/sys/arch/sparc/compile/GENERIC_SCSI3
real mem = 268099584
avail mem = 254107648
using 1792 buffers containing 7340032 bytes of memory
bootpath: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0
mainbus0 (root): SUNW,SPARCstation-5
cpu0 at mainbus0: MB86904 @ 85 MHz, on-chip FPU
cpu0: 16K instruction (32 b/l), 8K data (16 b/l): cache enabled
obio0 at mainbus0
clock0 at obio0 addr 0x71200000: mk48t08 (eeprom)
timer0 at obio0 addr 0x71d00000 delay constant 40
zs0 at obio0 addr 0x71100000 pri 12, softpri 6
zstty0 at zs0 channel 0 (console)
zstty1 at zs0 channel 1
zs1 at obio0 addr 0x71000000 pri 12, softpri 6
kbd0 at zs1 channel 0
ms0 at zs1 channel 1
[slavioconfig at obio0] addr 0x71800000 not configured
auxreg0 at obio0 addr 0x71900000
power0 at obio0 addr 0x71910000
fdc0 at obio0 addr 0x71400000 pri 11, softpri 4: chip 82077
iommu0 at mainbus0 addr 0x10000000: version 0x4/0x0, page-size 4096, range 64MB
sbus0 at iommu0: clock = 21.250 MHz
dma0 at sbus0 slot 5 offset 0x8400000: rev 2
esp0 at dma0 slot 0x5 offset 0x8800000 pri 4: ESP200, 40MHz, SCSI ID 7
scsibus0 at esp0: 8 targets
probe(esp0:1:0): max sync rate 10.00Mb/s
sd1 at scsibus0 targ 1 lun 0: <QUANTUM, FIREBALL_TM2110S, 300X> SCSI2 0/direct fixed
sd1: 2014MB, 6810 cyl, 4 head, 151 sec, 512 bytes/sect x 4124736 sectors
probe(esp0:3:0): max sync rate 10.00Mb/s
sd0 at scsibus0 targ 3 lun 0: <SEAGATE, ST5660N  SUN0535, 0522> SCSI2 0/direct fixed
sd0: 520MB, 3002 cyl, 4 head, 88 sec, 512 bytes/sect x 1065664 sectors
SUNW,bpp at sbus0 slot 5 offset 0xc800000 not configured
ledma0 at sbus0 slot 5 offset 0x8400010: rev 2
le0 at ledma0 slot 0x5 offset 0x8c00000 pri 6: address 08:00:20:21:76:cc
le0: 8 receive buffers, 2 transmit buffers
dma1 at sbus0 slot 1 offset 0x100000: rev 1+
esp1 at sbus0 slot 1 offset 0x200000 pri 5: ESP100A, 25MHz, SCSI ID 7
scsibus1 at esp1: 8 targets
probe(esp1:0:0): max sync rate 5.00Mb/s
sd3 at scsibus1 targ 0 lun 0: <SEAGATE, ST15150N, 0020> SCSI2 0/direct fixed
sd3: 4101MB, 3712 cyl, 21 head, 107 sec, 512 bytes/sect x 8399448 sectors
dma2 at sbus0 slot 2 offset 0x100000: rev 1+
esp2 at sbus0 slot 2 offset 0x200000 pri 5: ESP100A, 25MHz, SCSI ID 7
scsibus2 at esp2: 8 targets
probe(esp2:3:0): max sync rate 5.00Mb/s
sd4 at scsibus2 targ 3 lun 0: <SEAGATE, ST19171N, 0024> SCSI2 0/direct fixed
sd4: 8683MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
dma3 at sbus0 slot 3 offset 0x100000: rev 1+
esp3 at sbus0 slot 3 offset 0x200000 pri 5: ESP100A, 25MHz, SCSI ID 7
scsibus3 at esp3: 8 targets
power-management at sbus0 slot 4 offset 0xa000000 not configured
SUNW,CS4231 at sbus0 slot 4 offset 0xc000000 not configured
afx-misc at sbus0 slot 4 offset 0xe000000 not configured
root on sd0a dumps on sd0b
root file system type: ffs
swapctl: adding /dev/sd0b as swap device at priority 0
swapctl: adding /dev/sd1b as swap device at priority 0
Automatic boot in progress: starting file system checks.