Subject: Re: Recent (since 1.6E?) sparc64 instability?
To: Martin Husemann <martin@duskware.de>
From: Rafal Boni <rafal@attbi.com>
List: port-sparc64
Date: 11/13/2002 09:50:56
In message <200211112142.gABLgTi07173@fearless-vampire-killer.waterside.net>, 
I had written:

-> I've now got conserver set up on one of my systems, so all serial output on
-> the U5 should now be logged; maybe this will at produce interesting output
-> next time this happens...

This morning, the machine was once again down, and this time I have the
console logs, plus a trace of my 30-second debugging attempt (I need to
figure out what this all means, as I know ~ 0 about the sparc or ultra-
sparc chips), which I'll attach below...

The current PC doesn't seem to be sensible, and I'm not sure which of
the trap register sets are the ones to consider, but I can provide the
nm output on the running kernel (or functions extrapolated therefrom)
if need be... I thought I'd post this in case it rang any bells for
the experts, and if not I'll try and make some sense of it myself or
with the help of some clues from the ether 8-)

--rafal


Watchdog Reset
Externally Initiated Reset
ok 
ok ctrace 
PC: fffffffff00025b0 
Last leaf: Fast Data Access MMU Miss
ok .registers
        Normal          Alternate       MMU               Vector
0:                 0                0                0                0
1:        9911041407 504f535400000000       4482008207          1805260
2:           10081a8                0                0              7e0
3:           10081ac           141039 fffffffff000258c                0
4:                 0 fffffffffffffffe                7         fffe7b48
5:                 0          1000000               40          1ecf900
6:                 0 fffffffff00025a8           10f784               20
7:                 3 fffffffff0002350 fffffffff0002560                1
%PC  fffffffff00025b0 %nPC fffffffff00025b0 
%TBA 1000000 %CCR 80 XCC:Nzvc   ICC:nzvc
ok .trap-registers 
%TL:1 %TT:3 %TPC:fffffffff00025b0 %TnPC:fffffffff00025b0 
%TSTATE:4415043400  %CWP:0 
   %PSTATE:434 AG:0 IE:0 PRIV:1 AM:0 PEF:1 RED:1 MM:0 TLE:0 CLE:0 MG:1 IG:0 
   %ASI:15  %CCR:44  XCC:nZvc   ICC:nZvc

%TL:2 %TT:c8 %TPC:1009204 %TnPC:1009208 
%TSTATE:4482000500  %CWP:0 
   %PSTATE:5 AG:1 IE:0 PRIV:1 AM:0 PEF:0 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0 
   %ASI:82  %CCR:44  XCC:nZvc   ICC:nZvc

%TL:3 %TT:68 %TPC:1005804 %TnPC:1005808 
%TSTATE:11001507  %CWP:7 
   %PSTATE:15 AG:1 IE:0 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0 
   %ASI:11  %CCR:0  XCC:nzvc   ICC:nzvc

%TL:4 %TT:a8 %TPC:100870c %TnPC:1008710 
%TSTATE:4458000400  %CWP:0 
   %PSTATE:4 AG:0 IE:0 PRIV:1 AM:0 PEF:0 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0 
   %ASI:58  %CCR:44  XCC:nZvc   ICC:nZvc

%TL:5 %TT:68 %TPC:1005404 %TnPC:1005408 
%TSTATE:11001507  %CWP:7 
   %PSTATE:15 AG:1 IE:0 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0 
   %ASI:11  %CCR:0  XCC:nzvc   ICC:nzvc

ok .pstate 
AG:0 IE:1 PRIV:1 AM:0 PEF:1 RED:0 MM:0 TLE:0 CLE:0 MG:0 IG:0 
ok boot
Resetting ... 


Sun Ultra 5/10 UPA/PCI (UltraSPARC-IIi 360MHz), No Keyboard
OpenBoot 3.25, 128 MB (50 ns) memory installed, Serial #xxxxxxxx.
Ethernet address 8:0:20:xx:xx:xx, Host ID: 80xxxxxx.



Initializing Memory \                                                                      Initializing Memory /                   
Boot device: disk:a  File and args: 
NetBSD IEEE 1275 Bootblock
..>> NetBSD/sparc64 OpenFirmware Boot, Revision 1.5
>> (chs@ultra2, Sun Dec 23 01:58:13 PST 2001)
devopen: getdisklabel sez no disk label
devopen: search label sez no disk label
               -
+oadfile: reading header
elf64 exec: Booting /pci@1f,0/pci@1,1/ide@3/disk@0,0:a/netbsd
     -
1948784@0x1000000/92960@0x1800000+4101344@0x1816b20 
/ymbols @ 0xfff18300 74+186768+101096/start=0x1000000
chain: calling OF chain(800000, ee08, 1000000, fffa9a80, 18)
                 -
[ using 288712 bytes of netbsd ELF symbol table ]
console is /pci@1f,0/pci@1,1/ebus@1/se@14,400000:b
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 1.6I (FEARLESS VAMPIRE KILLER) #6: Tue Oct 15 01:38:45 EDT 2002
                      -       -
    rafal@:/extra/src-current/sys/arch/sparc64/compile/FEARLESS VAMPIRE KILLER
                                                                -       -
total memory = 128 MB
avail memory = 111 MB
using 832 buffers containing 6656 KB of memory
bootpath: /pci@1f,0/pci@1,1/ide@3,0/disk@0,0
mainbus0 (root): SUNW,Ultra-5 10
                              -
cpu0 at mainbus0: SUNW,UltraSPARC-IIi @ 360 MHz, version 0 FPU
cpu0: 32K instruction (32 b/l), 16K data (32 b/l), 256K external (64 b/l)
psycho0 at mainbus0 addr 0xfffc4000
SUNW,sabre: impl 0, version 0: ign 7c0 bus range 0 to 2; PCI bus 0
DVMA map: c0000000 to e0000000
IOTSB: 10a42000 to 10ac2000
pci0 at psycho0
pci0: i/o space, memory space enabled
ppb0 at pci0 dev 1 function 1: Sun Microsystems Simba PCI bridge (rev. 0x13)
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled
ebus0 at pci1 dev 1 function 0
ebus0: Sun Microsystems PCIO Ebus2, revision 0x01
auxio0 at ebus0 addr 726000-726003, 728000-728003, 72a000-72a003, 72c000-72c003, 72f000-72f003
power at ebus0 addr 724000-724003 ipl 37 not configured
SUNW,pll at ebus0 addr 504000-504002 not configured
sab0 at ebus0 addr 400000-40007f ipl 43: rev 3.2
sabtty0 at sab0 port 0
sabtty1 at sab0 port 1: console i/o
com0 at ebus0 addr 3083f8-3083ff ipl 41: ns16550a, working fifo
kbd0 at com0
com1 at ebus0 addr 3062f8-3062ff ipl 42: ns16550a, working fifo
ms0 at com1
lpt0 at ebus0 addr 3043bc-3043cb, 30015c-30015d, 700000-70000f ipl 34
fdthree at ebus0 addr 3023f0-3023f7, 706000-70600f, 720000-720003 ipl 39 not configured
clock0 at ebus0 addr 0-1fff: mk48t59: hostid xxxxxxxx
flashprom at ebus0 addr 0-fffff not configured
audiocs0 at ebus0 addr 200000-2000ff, 702000-70200f, 704000-70400f, 722000-722003 ipl 35 ipl 36: CS4231A
audio0 at audiocs0: full duplex
hme0 at pci1 dev 1 function 1: Sun Happy Meal Ethernet, rev. 1
hme0: interrupting at ivec 3021
hme0: Ethernet address 08:00:20:xx:xx:xx
nsphy0 at hme0 phy 1: DP83840 10/100 media interface, rev. 1
nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
pciide0 at pci1 dev 3 function 0: CMD Technology PCI0646 (rev. 0x03)
pciide0: bus-master DMA support present
pciide0: primary channel configured to native-PCI mode
pciide0: using ivec 1820 for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <ST38410A>
wd0: drive supports 32-sector PIO transfers, LBA addressing
wd0: 8223 MB, 16708 cyl, 16 head, 63 sec, 512 bytes/sect x 16841664 sectors
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 4 (Ultra/66)
wd0(pciide0:0:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
pciide0: secondary channel configured to native-PCI mode
atapibus0 at pciide0 channel 1: 2 targets
cd0 at atapibus0 drive 0: <CRD-8322B, 1998/09/24, 1.05> cdrom removable
cd0: drive supports PIO mode 4, DMA mode 2
cd0(pciide0:1:0): using PIO mode 4, DMA mode 2 (using DMA data transfers)
ppb1 at pci0 dev 1 function 0: Sun Microsystems Simba PCI bridge (rev. 0x13)
pci2 at ppb1 bus 2
pci2: i/o space, memory space enabled
3D Labs GLINT Permedia 2V (miscellaneous display, revision 0x01) at pci2 dev 1 function 0 not configured
pcons at mainbus0 not configured
No counter-timer -- using %tick at 360MHz as system clock.
IPsec: Initialized Security Association Processing.
root on wd0a dumps on wd0b
root file system type: ffs
Wed Nov 13 14:31:24 GMT 2002
swapctl: adding /dev/wd0b as swap device at priority 0
Starting file system checks:
[...]

----
Rafal Boni                                                     rafal@attbi.com
  We are all worms.  But I do believe I am a glowworm.  -- Winston Churchill