Subject: Re: CURRENT build crash
To: None <port-sgimips@NetBSD.org>
From: david l goodrich <dlg@dorkzilla.org>
List: port-sgimips
Date: 08/22/2004 14:20:36
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enigDF24914F1C35FAD9E9A93BDF
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit
I built a debugging kernel, and was finally able to make the machine crash
running `./build.sh -u -m sgimips build` ... the debugging kernel was
surprisingly robust...
but i still get
Checking for core dump...
savecore: no core dump
when the machine reboots...
the only change i made in the kernel conf to make this kernel is to unccomment
makeoptions DEBUG="-g" # compile full symbol table
is there something more i need to do?
--david
# objdir /usr/obj/lib/csu/mips
obj ===> lib/libc
trap: address error (load or I-fetch) in kernel mode
status=0xff03, cause=0x10, epc=0x8023f4bc, vaddr=0xc37c437f
pid=5690 cmd=nbmake usp=0x7fffdcc8 ksp=0xc52dd9c8
Stopped in pid 5690.1 (nbmake) at 0x8023f4bc: lhu s2,4(s1)
db> trace
8023f318+1a4 (1fff,2,0,0) ra 8023f47c sz 0
8023f318+164 (1fff,2,0,0) ra 0 sz 0
User-level: pid 5690.1
db> ps
PID PPID PGRP UID S FLAGS LWPS COMMAND WAIT
>>5690 2249 434 0 2 0x4002 1 nbmake
2249 7061 434 0 2 0x4002 1 sh wait
7061 6227 434 0 2 0x4002 1 nbmake wait
6227 3319 434 0 2 0x4002 1 sh wait
3319 3343 434 0 2 0x4002 1 nbmake wait
3343 3480 434 0 2 0x4002 1 sh wait
3480 434 434 0 2 0x4002 1 nbmake wait
434 333 434 0 2 0x4002 1 sh wait
415 0 0 0 2 0x20200 1 nfsio nfsidl
411 0 0 0 2 0x20200 1 nfsio nfsidl
413 0 0 0 2 0x20200 1 nfsio nfsidl
400 0 0 0 2 0x20200 1 nfsio nfsidl
333 1 333 0 2 0x4003 1 bash wait
337 1 337 0 2 0 1 cron nanosle
330 1 330 0 2 0 1 inetd kqread
286 1 286 0 2 0x100 1 sendmail select
276 1 276 0 2 0 1 sshd select
150 1 150 0 2 0 1 syslogd
8 0 0 0 2 0x20200 1 aiodoned aiodone
7 0 0 0 2 0x20200 1 ioflush syncer
6 0 0 0 2 0x20200 1 pagedaemon pgdaemo
5 0 0 0 2 0x20200 1 lfs_writer lfswrit
4 0 0 0 2 0x20200 1 scsibus1 sccomp
3 0 0 0 2 0x20200 1 scsibus0 sccomp
2 0 0 0 2 0x20200 1 cryptoret crypto_
1 0 1 0 2 0x4000 1 init wait
0 -1 0 0 2 0x20200 1 swapper schedul
db> t
8023f318+1a4 (1fff,2,0,0) ra 8023f47c sz 0
8023f318+164 (1fff,2,0,0) ra 0 sz 0
User-level: pid 5690.1
db> dmesg
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004
The NetBSD Foundation, Inc. All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
NetBSD 2.0_BETA (DEBUG32_IP3x) #1: Sun Aug 15 12:58:06 EDT 2004
root@neptune:/usr/src/sys/arch/sgimips/compile/DEBUG32_IP3x
total memory = 128 MB
(6848 KB reserved for ARCS)
avail memory = 113 MB
mainbus0 (root): SGI-IP32 [SGI, 8], 1 processor
cpu0 at mainbus0: MIPS R10000 CPU (0x926) Rev. 2.6 with built-in FPU Rev. 0.0
cpu0: 32KB/64B 2-way set-associative L1 Instruction cache, 64 TLB entries
cpu0: 32KB/32B 2-way set-associative write-back L1 Data cache
cpu0: 1024KB/64B 2-way set-associative write-back L2 Data cache
crime0 at mainbus0 addr 0x14000000: rev 1.1 (CRIME_ID: a1)
mace0 at mainbus0 addr 0x1f000000
com0 at mace0 offset 0x390000 intr 4 intrmask 0x3f00000: ns16550a, working fifo
com0: console
mace: established interrupt 4 (level 3f00000)
com1 at mace0 offset 0x398000 intr 4 intrmask 0xfc000000: ns16550a, working fifo
mace: established interrupt 4 (level fc000000)
pckbc0 at mace0 offset 0x320000 intr 5 intrmask 0x0
mcclock0 at mace0 offset 0x3a0000 intrmask 0x0
mec0 at mace0 offset 0x280000 intr 3 intrmask 0x0: MAC-110 Ethernet, rev 1
mec0: Ethernet address 08:00:69:0c:2b:78
nsphy0 at mec0 phy 8: DP83840 10/100 media interface, rev. 1
nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
mace: established interrupt 3 (level 0)
macepci0 at mace0 offset 0x80000 intr 7 intrmask 0x0: rev 1
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x00001000 (size 0x100)
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x80100000 (size 0x1000)
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x00002000 (size 0x100)
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x80200000 (size 0x1000)
mace: established interrupt 7 (level 0)
pci0 at macepci0 bus 0
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
ahc0 at pci0 dev 1 function 0: Adaptec aic7880 Ultra SCSI adapter
ahc0: interrupting at crime interrupt 8
ahc0: Using left over BIOS settings
ahc0: aic7880: Wide Channel A, SCSI Id=0, 16/253 SCBs
scsibus0 at ahc0: 16 targets, 8 luns per target
ahc1 at pci0 dev 2 function 0: Adaptec aic7880 Ultra SCSI adapter
ahc1: interrupting at crime interrupt 9
ahc1: Using left over BIOS settings
ahc1: aic7880: Wide Channel A, SCSI Id=0, 16/253 SCBs
scsibus1 at ahc1: 16 targets, 8 luns per target
biomask 07 netmask 07 ttymask 07 clockmask 87
scsibus0: waiting 2 seconds for devices to settle...
scsibus1: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 2 lun 0: <SGI, IBM DCAS-32160W, S62A> disk fixed
sd0: 2049 MB, 8188 cyl, 3 head, 170 sec, 512 bytes/sect x 4197405 sectors
sd0: async, 8-bit transfers, tagged queueing
cd0 at scsibus0 target 4 lun 0: <TOSHIBA, CD-ROM XM-5701TA, 0167> cdrom removabl
e
cd0: async, 8-bit transfers
cd0(ahc0:0:4:0): Check Condition on CDB: 0x00 00 00 00 00 00
SENSE KEY: Media Error
ASC/ASCQ: Unable To Recover Table-Of-Contents
boot device: sd0
root on sd0a dumps on sd0b
root file system type: ffs
trap: address error (load or I-fetch) in kernel mode
status=0xff03, cause=0x10, epc=0x8023f4bc, vaddr=0xc37c437f
pid=5690 cmd=nbmake usp=0x7fffdcc8 ksp=0xc52dd9c8
db>
On Sun, Aug 15, 2004 at 11:15:32AM -0400, david l goodrich wrote:
>> Did it again last night. But since this is a known error, is further debgging
>> information of any value to you?
>> --david
>>
>> On Sun, 15 Aug 2004 20:47:35 +0900
>> Christopher SEKIYA <wileyc@rezrov.net> wrote:
>>
>
>>> > On Sun, Aug 15, 2004 at 12:51:24AM -0400, david l goodrich wrote:
>>> >
>>
>>>> > > crime: memory error address 27bc8c00 status 2040000
>>
>>> >
>>> > It's a r10k machine. You'll unfortunately have to expect random crashes from
>>> > time to time -- until we work around the brain-damaged IP32 non-coherent
>>> > cache architecture.
>>> >
>>> > (it's on my list, along with mips64, X for grtwo, and an IP32 ahc fix)
>>> >
>>> > -- Chris
>>> > GPG key FEB9DE7F (91AF 4534 4529 4BCC 31A5 938E 023E EEFB FEB9 DE7F)
>
>>
>>
--------------enigDF24914F1C35FAD9E9A93BDF
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (Darwin)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
iD8DBQFBKOP5sw9Pt+GRQPURAo93AJ95aeJXGVkZwpgXdQsBmeckKGDNBQCgs7t/
x1V2RrdtwINrgy0rRul3Fdc=
=530k
-----END PGP SIGNATURE-----
--------------enigDF24914F1C35FAD9E9A93BDF--