Subject: Problems with NetBSD 1.3 Solaris2 original filesystem
To: None <port-sparc@NetBSD.ORG, current-users@NetBSD.ORG>
From: Brian Buhrow <buhrow@cats.ucsc.edu>
List: current-users
Date: 01/06/1998 14:38:03
	Hello NetBSD users.  I wrote yesterday concerning a troublesome Sparc
5 which kept dying due to a kernel data fault.  Several people wrote back
saying that they thought the problem was a memory error.  Thanks for the
pointer, I thought it was a cache error.
	Now, after having changed the CPU and the RAM, we're still seeing a
similar problem whereby the machine panics due to a kernel data fault.
However, we're also seeing a new type of panic on a 8GB filesystem which
was originally created under Solaris 2.5.  Could someone explain what the
following message means, and if there is a way to fix it short of newfsing
the entire disk?  Also, could this message, in some way, be causing the
kernel data faults as well?  What we see is a panic due to a kernel fault,
20 minutes of activity, a panic due to this ifree message, another 20
minutes of activity, another panic due to the ifree message, and so forth
with reboots every 20 minutes or so.  Whether or not the panic is caused by
ifree or a data fault seems to be more or less random.
	Any help anyone could offer would be much appreciated.  I'm really
stumped by this one and the situation is getting desparate.

	Below is a sample panic message of either type, followed by a df on
the affected filesystem, followed by the boot output.  (I upgraded the
kernel to 1.3 today, but that hasn't changed anything.)


news# data fault: pc=0xf014f5a4 addr=0xef8eafea sfsr=226<FAV>
panic: kernel fault
syncing disks... 277 277 269 258 244 225 202 173 140 97 54 7 1 1 1 1 1 1 1 1 giving up
Frame pointer is at 0xf85cd8e0
Call traceback:
  pc = 0xf012dcd8  args = (0x0, 0x4001fe0, 0xf016e800, 0xf85cda00, 0xf85cd990, 0x0, 0xf85cd948) fp = 0xf85cd948
  pc = 0xf0038454  args = (0x100, 0x0, 0x1, 0xf85cda6c, 0xf85cd9f8, 0x0, 0xf85cd9b0) fp = 0xf85cd9b0
  pc = 0xf013adc8  args = (0x100, 0xf015e400, 0x1, 0xf85cda78, 0x0, 0x1, 0xf85cda18) fp = 0xf85cda18
  pc = 0xf00084fc  args = (0xf0be3600, 0x226, 0xef8eafea, 0xef8ea000, 0xf014f5a4, 0xf85cdb20, 0xf85cdac0) fp = 0xf85cdac0
  pc = 0xf00d3e64  args = (0xf1784811, 0x1e99827, 0xef8eafea, 0x0, 0xffffffff, 0x3140, 0xf85cdb70) fp = 0xf85cdb70
  pc = 0xf00d2ab0  args = (0x1e99827, 0x8c, 0x0, 0x41fd, 0x0, 0x8c, 0xf85cdbe0) fp = 0xf85cdbe0
  pc = 0xf00d2694  args = (0xf0ba3c00, 0x8c, 0x1aef00, 0x41fd, 0xf00d3c9c, 0x3140, 0xf85cdc48) fp = 0xf85cdc48
  pc = 0xf00e60ac  args = (0xf85cdd38, 0xf00d260c, 0xf015ec00, 0xffffffff, 0x0, 0x0, 0xf85cdcc0) fp = 0xf85cdcc0
  pc = 0xf005aea0  args = (0xf85cdde0, 0xf85cdde0, 0xf015ec00, 0xf0bc9880, 0x2, 0x0, 0xf85cdd80) fp = 0xf85cdd80
  pc = 0xf013b158  args = (0x0, 0xf85cdf28, 0xf85cdf20, 0xf005ace4, 0x1b4, 0xa, 0xf85cdec0) fp = 0xf85cdec0
  pc = 0xf0008794  args = (0x88, 0xf85cdfb0, 0x0, 0xcd6, 0x68, 0x0, 0xf85cdf50) fp = 0xf85cdf50
  pc = 0xdd0c  args = (0x2fca3c, 0x1fd, 0x0, 0xcd6, 0xefffeb28, 0x5378, 0xefffeab8) fp = 0xefffeab8
rebooting

Resetting ... 
[skip booting output, see below.]

[df -k here]
Script started on Tue Jan 06 14:34:20 1998
NetBSD 1.3 (GENERIC_SCSI3) #0: Thu Jan  1 19:03:39 MET 1998

Welcome to NetBSD!

news# df -k /var/spool/news
Filesystem  1K-blocks     Used    Avail Capacity  Mounted on
/dev/sd4c     8321498  6355510  1133838    85%    /var/spool/news
news# exit
news# logout
script done on Tue Jan 06 14:34:48 1998
login: sd4(esp2:3:0): illegal request, data = 00 00 00 00 21 00 03 cf 00 02
panic: ifree: range: dev = 0x722, ino = -254910704, fs = /var/spool/news

syncing disks... 193 193 186 173 159 141 118 90 56 20 done
Frame pointer is at 0xf85cdaa0
Call traceback:
  pc = 0xf012dcd8  args = (0x0, 0x4000fe3, 0xf016e800, 0xf85cdbc0, 0xf014f5a4, 0xf85cdb20, 0xf85cdb08) fp = 0xf85cdb08
  pc = 0xf0038454  args = (0x100, 0x0, 0x1, 0x0, 0x0, 0x3140, 0xf85cdb70) fp = 0xf85cdb70
  pc = 0xf00d49f4  args = (0x100, 0x722, 0xf0ce5f10, 0xf0b7f8d4, 0x0, 0x40, 0xf85cdbd8) fp = 0xf85cdbd8
  pc = 0xf00d26f0  args = (0xf85cdca8, 0xf00d49b4, 0xf85cdd54, 0xf00d82e0, 0xf00d3c9c, 0x3140, 0xf85cdc48) fp = 0xf85cdc48
  pc = 0xf00e60ac  args = (0xf85cdd38, 0xf00d260c, 0xf015ec00, 0xffffffff, 0x0, 0x0, 0xf85cdcc0) fp = 0xf85cdcc0
  pc = 0xf005aea0  args = (0xf85cdde0, 0xf85cdde0, 0xf015ec00, 0xf0bd8c00, 0x2, 0x0, 0xf85cdd80) fp = 0xf85cdd80
  pc = 0xf013b158  args = (0x0, 0xf85cdf28, 0xf85cdf20, 0xf005ace4, 0x1b4, 0xa, 0xf85cdec0) fp = 0xf85cdec0
  pc = 0xf0008794  args = (0x88, 0xf85cdfb0, 0x0, 0xc6d, 0x68, 0x0, 0xf85cdf50) fp = 0xf85cdf50
  pc = 0xdd0c  args = (0x2fca3c, 0x1fd, 0x0, 0xc6d, 0xefffeb28, 0x5378, 0xefffeab8) fp = 0xefffeab8
rebooting

Resetting ... 
screen not found.
Can't open input device.
Keyboard not present.  Using tty for input and output.

SPARCstation 5, No Keyboard
ROM Rev. 2.15, 256 MB memory installed, Serial #7788135.
Ethernet address 8:0:20:76:d6:67, Host ID: 8076d667.



Initializing Memory |/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\                                                                      Rebooting with command: 
Boot device: /iommu/sbus/espdma@5,8400000/esp@5,8800000/sd@3,0  File and args: 
>> NetBSD/sparc Secondary Boot, Revision 1.7
>> (pk@flambard, Mon Dec  1 11:25:00 MET 1997)
Booting netbsd @ 0x4000
1376256/+117040-\|/-\+73776+[77844|/-\+91089]=0x1abd4d
console is ttya
Copyright (c) 1996, 1997 The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.3 (GENERIC_SCSI3) #0: Thu Jan  1 19:03:39 MET 1998
    pk@flambard:/usr/src1/sys/arch/sparc/compile/GENERIC_SCSI3
real mem = 268107776
avail mem = 254115840
using 1792 buffers containing 7340032 bytes of memory
bootpath: /iommu@0,10000000/sbus@0,10001000/espdma@5,8400000/esp@5,8800000/sd@3,0
mainbus0 (root): SUNW,SPARCstation-5
cpu0 at mainbus0: MB86904 @ 85 MHz, on-chip FPU
cpu0: 16K instruction (32 b/l), 8K data (16 b/l): cache enabled
obio0 at mainbus0
clock0 at obio0 addr 0x71200000: mk48t08 (eeprom)
timer0 at obio0 addr 0x71d00000 delay constant 40
zs0 at obio0 addr 0x71100000 pri 12, softpri 6
zstty0 at zs0 channel 0 (console)ªzstty1 at zs0 channel 1
zs1 at obio0 addr 0x71000000 pri 12, softpri 6
kbd0 at zs1 channel 0
ms0 at zs1 channel 1
[slavioconfig at obio0] addr 0x71800000 not configured
auxreg0 at obio0 addr 0x71900000
power0 at obio0 addr 0x71910000
fdc0 at obio0 addr 0x71400000 pri 11, softpri 4: chip 82077
iommu0 at mainbus0 addr 0x10000000: version 0x4/0x0, page-size 4096, range 64MB
sbus0 at iommu0: clock = 21.250 MHz
dma0 at sbus0 slot 5 offset 0x8400000: rev 2
esp0 at dma0 slot 0x5 offset 0x8800000 pri 4: ESP200, 40MHz, SCSI ID 7
scsibus0 at esp0: 8 targets
probe(esp0:1:0): max sync rate 10.00Mb/s
sd1 at scsibus0 targ 1 lun 0: <QUANTUM, FIREBALL_TM2110S, 300X> SCSI2 0/direct fixed
sd1: 2014MB, 6810 cyl, 4 head, 151 sec, 512 bytes/sect x 4124736 sectors
probe(esp0:3:0): max sync rate 10.00Mb/s
sd0 at scsibus0 targ 3 lun 0: <SEAGATE, ST5660N  SUN0535, 0522> SCSI2 0/direct fixed
sd0: 520MB, 3002 cyl, 4 head, 88 sec, 512 bytes/sect x 1065664 sectors
SUNW,bpp at sbus0 slot 5 offset 0xc800000 not configured
ledma0 at sbus0 slot 5 offset 0x8400010: rev 2
le0 at ledma0 slot 0x5 offset 0x8c00000 pri 6: address 08:00:20:76:d6:67
le0: 8 receive buffers, 2 transmit buffers
SUNW,CS4231 at sbus0 slot 4 offset 0xc000000 not configured
power-management at sbus0 slot 4 offset 0xa000000 not configured
dma1 at sbus0 slot 1 offset 0x100000: rev 1+
esp1 at sbus0 slot 1 offset 0x200000 pri 5: ESP100A, 25MHz, SCSI ID 7
scsibus1 at esp1: 8 targets
probe(esp1:0:0): max sync rate 5.00Mb/s
sd3 at scsibus1 targ 0 lun 0: <SEAGATE, ST15150N, 0020> SCSI2 0/direct fixed
sd3: 4101MB, 3712 cyl, 21 head, 107 sec, 512 bytes/sect x 8399448 sectors
dma2 at sbus0 slot 2 offset 0x100000: rev 1+
esp2 at sbus0 slot 2 offset 0x200000 pri 5: ESP100A, 25MHz, SCSI ID 7
scsibus2 at esp2: 8 targets
probe(esp2:3:0): max sync rate 5.00Mb/s
sd4 at scsibus2 targ 3 lun 0: <SEAGATE, ST19171N, 0024> SCSI2 0/direct fixed
sd4: 8683MB, 5268 cyl, 20 head, 168 sec, 512 bytes/sect x 17783112 sectors
root on sd0a dumps on sd0b
root file system type: ffs
swapctl: adding /dev/sd0b as swap device at priority 0
swapctl: adding /dev/sd1b as swap device at priority 0
Automatic boot in progress: starting file system checks.
[boot messages deleted]
login: sd4(esp2:3:0): illegal request, data = 00 00 00 00 21 00 03 cf 00 02
panic: ifree: range: dev = 0x722, ino = -254910704, fs = /var/spool/news

syncing disks... 300 299 292 280 265 247 224 194 161 121 74 28 done