Port-sgimips archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Cache error on R5000 Challenge S



Hi,

I'm using a R5000/180 Challenge S as a mail/name/web server on my home
network. It's running 4.0 and after being up for 61 days it crashed with
a cache error. I've included the dmesg, stack backtrace and process list
below. I tried rebooting from the db> prompt but it threw another cache
error while running the rc scripts, details also included below. I then
did a hardware reset and it's now been running for 3 days with no
further errors. Could this be a hardware problem or is it more likely to
be software? If software, is there any additional information I should
collect the next time it happens?

Regards,
George

=== console output showing initial cache error then a second error
=== on reboot

db> dmesg
tlp0: receive error: dribbling bit
tlp0: receive error: CRC error
# lots of tlp0 errors deleted to save space
tlp0: receive error: CRC error
panic: cache error @ EPC 0x880696e0 ErrCtl 0x0 CacheErr 0xa42c77e3
db> bt
cpu_Debugger+4 (97fff000,d,0,bfbd9830) ra 8823cdbc sz 0
panic+190 (97fff000,880696e0,0,a42c77e3) ra 882c3278 sz 48
pmap_copy_page+b4 (97fff000,880696e0,0,a42c77e3) ra 881cb070 sz 40
uvmfault_promote+94 (97fff000,880696e0,ffffffff,a42c77e3) ra 881cc358 sz
64 uvm_fault_internal+d28 (96ceb2d8,1000,2,0) ra 882c56b8 sz 296
trap+684 (2000ff13,1000,2,7dea1ea0) ra 882beb44 sz 112
mips3_UserGenException+cc (2000ff13,1000,2,7dea1ea0) ra 0 sz 0
User-level: pid 831.1
db> ps
 PID     PPID     PGRP    UID S   FLAGS LWPS          COMMAND    WAIT
 16735  10000     3901   1006 2  0x4000    1            spamc   netio
 10000   3901     3901   1006 2   0x100    1         procmail    wait
 3901   26259     3901   1006 2  0x4100    1         procmail  piperd
 26259    361      361      0 2  0x4108    1            local    poll
 1229     361      361     12 2  0x4108    1          cleanup  kqread
 1960     361      361     12 2  0x4108    1  trivial-rewrite  kqread
 18082    361      361     12 2  0x4108    1         proxymap  kqread
 5078     361      361     12 2  0x4108    1            smtpd  kqread
 20014    361      361     12 2  0x4108    1           pickup  kqread
>831    18753    18753      0 2   0x100    1             perl
 15942  24126    15942   1006 2  0x4002    1          wish8.4  select
 24126  21598    24126   1006 2  0x4002    1             bash    wait
 21598      1     7549   1006 2  0x4002    1            xterm  select
 3655       1     3655      0 2       0    1            rarpd  select
 8678   16722     8678      0 2  0x4002    1             bash   ttyin
 16722  21360    16722      0 2  0x4002    1               sh    wait
 21360   5542    21360   1006 2  0x4002    1             bash    wait
 5542       1    23082   1006 2  0x4002    1            xterm  select
 26901   3985    26901   1006 2  0x4002    1             bash   ttyin
 3985    5022     3985      0 2  0x4102    1            login    wait
 5022     669      669      0 2  0x4000    1          telnetd    poll
 28628  18753    18753      0 2   0x100    1             perl  select
 27387  25797    25797   1000 2   0x100    1            httpd  netcon
 8951   25797    25797   1000 2   0x100    1            httpd  netcon
 8458   25797    25797   1000 2   0x100    1            httpd  netcon
 19596      1    19596      0 2       0    1            named  select
 3326   25797    25797   1000 2   0x100    1            httpd  netcon
 26162  25797    25797   1000 2   0x100    1            httpd  netcon
 10353  25797    25797   1000 2   0x100    1            httpd  netcon
 12748  25797    25797   1000 2   0x100    1            httpd  netcon
 2313   25797    25797   1000 2   0x100    1            httpd  netcon
 16064  25797    25797   1000 2   0x100    1            httpd  netcon
 29410  25797    25797   1000 2   0x100    1            httpd  netcon
 25797      1    25797      0 2       0    1            httpd  select
 18753      1    18753      0 2  0x4002    1             perl  select
 4616       1     9385   1006 2  0x4002    1         tclsh8.4  select
 2883       1    24611      0 2       0    1            snmpd  select
 8118       1     8118      0 2  0x4002    1            getty   ttyin
 755        0        0      0 2 0x20200    1            nfsio  nfsidl
 352        0        0      0 2 0x20200    1            nfsio  nfsidl
 356        0        0      0 2 0x20200    1            nfsio  nfsidl
 647        0        0      0 2 0x20200    1            nfsio  nfsidl
 227        1      227      0 2       0    1             cron nanosle
 225      361      361     12 2  0x4108    1             qmgr  kqread
 669        1      669      0 2       0    1            inetd  kqread
 361        1      361      0 2  0x4108    1           master  kqread
 502        1      502      0 2       0    1             sshd  select
 467        1      467      0 2       0    1             mopd    poll
 428        1      428      0 2       0    1            dhcpd  select
 404        1      404      0 2       0    1   rpc.bootparamd  select
 366      359      359      0 2       0    1             nfsd    nfsd
 363      359      359      0 2       0    1             nfsd    nfsd
 348      359      359      0 2       0    1             nfsd    nfsd
 346      359      359      0 2       0    1             nfsd    nfsd
 359        1      359      0 2       0    1             nfsd    poll
 328        1      328      0 2       0    1           mountd  select
 271        1      271      0 2       0    1          rpcbind    poll
 242        1      242      0 2       0    1          syslogd  kqread
 201        1      201      0 2       0    1           routed  select
 27         0        0      0 2 0x20200    1          physiod physiod
 5          0        0      0 2 0x20200    1         aiodoned aiodone
 4          0        0      0 2 0x20200    1          ioflush  syncer
 3          0        0      0 2 0x20200    1       pagedaemon pgdaemo
 2          0        0      0 2 0x20200    1         scsibus0  sccomp
 1          0        1      0 2  0x4001    1             init    wait
 0         -1        0      0 2 0x20200    1          swapper schedul
db> reboot
syncing disks... 12 tlp0: receive ring overrun
12 12 11 10 9 8 7 6 6 5 5 5 4 3 2 1 done
rebooting...

                           Starting up the system...

NetBSD/sgimips 4.0_BETA2 Bootstrap, Revision 1.2
(root%indy3.ceiridos.co.uk@localhost, Mon Mar 19 22:13:55 GMT 2007)

devopen: scsi(0)disk(2)rdisk(0)partition(0) type scsi file netbsd
3314416+192508 [187408+179406]=0x3b1f98
Found bootinfo at 0x8800d710
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004,
2005,    2006, 2007
    The NetBSD Foundation, Inc.  All rights reservd.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of he University of California.  All rights reservd.

NetBSD 4.0 (GENERIC32_IP2x) #0: Sun Dec 1 02:11:52 PST 2007

builds@wb41:/home/builds/ab/netbsd-4-0-RELEASE/sgimips/200712160005Z-ob
j/home/builds/ab/netbsd-4-0-RELEASE/src/sys/arch/sgimips/compile/GENERI
C32_IP2x total memory = 256 MB
(768 KB reserved for ARCS)
avail memory = 245 MB
timeconter: Timecounters tick every 10.000 msec
mainbus0 (root): SGI-IP22 [SGI, 690ac9fb], 1 processor
cpu0 at mainbus0: MIPS R5000 CPU (0x2310) Rev. 1.0 with built-in FPU
Rev. 1.0 cpu0: 32KB/32B 2-way set-associative L1 Instruction cache, 48
TLB etries cpu0: 32KB/32B 2-way set-associative wrie-back L1 Data cach
cpu0: 512KB/32B direct-mapped write-through L2 Data cache
ioc0 at mainbus0 addr 0x1fbd9800: rev 0, machine Indy (Guiness), board
rev 0 int0 at mainbus0 addr 0x1fbd9880: bus 90MHz, CPU 180MHz
imc0 at mainbus0 addr 0x1fa00000: revision 3
gio0 at imc0
giopci0 at gio0 slot 1 addr 0x1f400000: Phobos G130 10/100 Ethernet
pci0 at giopci0 bus 0
pci0: memory space enabled
tlp0 at pci0 dev 0 function 0: DECchip 21143 Ethernet, pass 4.1
tlp0: interrupting at slot EXP0
tlp0: Ethernet address 00:60:f5:08:23:07
lxtphy0 at tlp0 phy 1: LXT970 10/100 media interface, rev. 3
lxtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
Synchronous ISDN (product 0x04 revision 0x00) at gio0 slot 0 addr
0x1f000000 not configured hpc0 at gio0 addr 0x1fb80000: SGI HPC3
zsc0 at hpc0 offset 0x59830
zstty0 at zsc0 channel 1 (console i/o)
zstty1 at zsc0 channel 0
pckbc0 at hpc0 offset 0x59840
sq0 at hpc0 offset 0x54000: SGI Seeq 80c03
sq0: Ethernet address 08:00:69:0a:c9:fb
wdsc0 at hpc0 offset 0x44000: WD33C93B revision 0, 10.0 MHz, SCSI ID 0
scsibus0 at wdsc0: 8 targets, 8 luns per target
dsclock0 at hpc0 offset 0x60000
pi1ppc0 at hpc0 offset 0x58000
pi1ppc0: capabilities=8<PS2>
ppbus0 at pi1ppc0
ppbus0: No IEEE1284 device found.
pi1ppc at hpc0 offset 0x59800 not configured
hpc1 at gio0 addr 0x1fb00000: SGI HPC3
sq at hpc1 offset 0x100 not configured
biomask 07 netmask 07 ttymask 0f clockmask bf
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "mips3_cp0_counter" frequency 90000000 Hz
quality 100 scsibus0: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 2 lun 0: <IBM, DNES-309170, SAH0> disk fixed
sd0: 8748 MB, 11474 cyl, 5 head, 312 sec, 512 bytes/sect x 17916240
sectors sd0: sync (200.00ns offset 12), 8-bit (5.000MB/s) transfers,
tagged queueing boot device: sd0
root on sd0a dumps on sd0b
root file system type: ffs
WARNING: clock gained 3 days
WARNING: CHECK AND RESET THE DATE!
Wed Mar 26 08:10:22 GMT 2008
swapctl: adding /dev/sd0b as swap device at priority 0
Checking for botched superblock upgrades:bus error: cpu_stat 00000380
addr 17ac77e0, gio_stat 00000000 addr 1fbc4003 panic: cache error @ EPC
0x88069740 ErrCtl 0x0 CacheErr 0xa42c77e3 Stopped in pid 26.1 (cat) at  
 netbsd:cpu_Debugger+0x4:        jr      ra                bdslot: nop
db> bt
cpu_Debugger+4 (97fff000,d,0,bfbd9830) ra 8823cdbc sz 0
panic+190 (97fff000,88069740,0,a42c77e3) ra 882c3ed4 sz 48
pmap_zero_page+bc (97fff000,88069740,0,a42c77e3) ra 881dabf4 sz 40
uvm_pagealloc_strat+1d4 (97fff000,88069740,0,0) ra 881cb058 sz 64
uvmfault_promote+7c (97fff000,88069740,ffffffff,0) ra 881cc5c8 sz 64
uvm_fault_internal+f98 (97dc5780,1000,1,0) ra 882c56b8 sz 296
trap+684 (ff13,1000,1,7df2fd2c) ra 882beb44 sz 112
mips3_UserGenException+cc (ff13,1000,1,7df2fd2c) ra 0 sz 0
User-level: pid 26.1
db> ps
 PID     PPID     PGRP    UID S   FLAGS LWPS          COMMAND    WAIT
 27         0        0      0 2 0x20200    1          physiod physiod
>26        23        6      0 2  0x4002    1              cat
 25        23        6      0 2  0x4002    1               dd
 24        23        6      0 2  0x4002    1               dd  pipdwt
 23        22        6      0 2     0x2    1               sh    wait
 22        16        6      0 2     0x2    1               sh    wait
 16         6        6      0 2     0x2    1               sh  piperd
 6          1        6      0 2  0x4002    1               sh    wait
 5          0        0      0 2 0x20200    1         aiodoned aiodone
 4          0        0      0 2 0x20200    1          ioflush  syncer
 3          0        0      0 2 0x20200    1       pagedaemon pgdaemo
 2          0        0      0 2 0x20200    1         scsibus0  sccomp
 1          0        1      0 2  0x4000    1             init    wait
 0         -1        0      0 2 0x20200    1          swapper schedul
db> 



Home | Main Index | Thread Index | Old Index