Subject: RE: netbsd autoreboot problem on SGI O2
To: Manuel Bouyer <bouyer@antioche.eu.org>
From: Christopher Padwick <cpadwick@ittvis.com>
List: port-sgimips
Date: 11/13/2006 21:46:19
This is a multi-part message in MIME format.

------_=_NextPart_001_01C707A8.7522F15F
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Hi Manuel,

Here is the stack trace:

db> trace
8036c4b4+214 (8ffff000,bf390000,0,d) ra 802bb69c sz 0
802bb50c+190 (8ffff000,d,0,72) ra 80250b00 sz 40
80250af4+c (8ffff000,d,0,72) ra 0 sz 0
User-level: pid 72.1

anything useful in there?  I've also got the output of dmesg here too.  =
I notice that it's panicing about a "bad dir ino" down at the bottom.  =
Thsi is new, it wasn't doing this before.  Any way to repair that?

db> dmesg
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005
    The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 3.0.2 (GENERIC32_IP3x) #0: Wed Nov  1 06:04:52 UTC 2006
        =
builds@b0.netbsd.org:/home/builds/ab/netbsd-3-0-2-RELEASE/sgimips/200610
311952Z-obj/home/builds/ab/netbsd-3-0-2-RELEASE/src/sys/arch/sgimips/comp=
ile/GEN
ERIC32_IP3x
total memory =3D 256 MB
(6848 KB reserved for ARCS)
avail memory =3D 238 MB
mainbus0 (root): SGI-IP32 [SGI, 0], 1 processor
cpu0 at mainbus0: MIPS R5000 CPU (0x2321) Rev. 2.1 with built-in FPU =
Rev. 1.0
cpu0: 32KB/32B 2-way set-associative L1 Instruction cache, 48 TLB =
entries
cpu0: 32KB/32B 2-way set-associative write-back L1 Data cache
cpu0: 512KB/32B direct-mapped write-through L2 Unified cache
crime0 at mainbus0 addr 0x14000000: rev 1.1 (CRIME_ID: a1)
mace0 at mainbus0 addr 0x1f000000
lpt0 at mace0 offset 0x380000 intr 4 intrmask 0xf0000
mace: established interrupt 4 (level f0000)
com0 at mace0 offset 0x390000 intr 4 intrmask 0x3f00000: ns16550a, =
working fifo
com0: console
mace: established interrupt 4 (level 3f00000)
com1 at mace0 offset 0x398000 intr 4 intrmask 0xfc000000: ns16550a, =
working fifo
mace: established interrupt 4 (level fc000000)
pckbc0 at mace0 offset 0x320000 intr 5 intrmask 0x0
mcclock0 at mace0 offset 0x3a0000 intrmask 0x0
mec0 at mace0 offset 0x280000 intr 3 intrmask 0x0: MAC-110 Ethernet, rev =
1
mec0: Ethernet address 08:00:69:0c:58:80
nsphy0 at mec0 phy 8: DP83840 10/100 media interface, rev. 1
nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
mace: established interrupt 3 (level 0)
macepci0 at mace0 offset 0x80000 intr 7 intrmask 0x0: rev 1
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x00001000 (size =
0x100)
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x80100000 (size =
0x1000)
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x00002000 (size =
0x100)
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x80200000 (size =
0x1000)
mace: established interrupt 7 (level 0)
pci0 at macepci0 bus 0
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
ahc0 at pci0 dev 1 function 0: Adaptec aic7880 Ultra SCSI adapter
mace: established interrupt 8 (level 0)
ahc0: interrupting at crime interrupt 8
ahc0: Using left over BIOS settings
ahc0: aic7880: Wide Channel A, SCSI Id=3D0, 16/253 SCBs
scsibus0 at ahc0: 16 targets, 8 luns per target
ahc1 at pci0 dev 2 function 0: Adaptec aic7880 Ultra SCSI adapter
mace: established interrupt 9 (level 0)
ahc1: interrupting at crime interrupt 9
ahc1: Using left over BIOS settings
ahc1: aic7880: Wide Channel A, SCSI Id=3D0, 16/253 SCBs
scsibus1 at ahc1: 16 targets, 8 luns per target
biomask 07 netmask 07 ttymask 07 clockmask 87
scsibus0: waiting 2 seconds for devices to settle...
scsibus1: waiting 2 seconds for devices to settle...
sd0 at scsibus0 target 1 lun 0: <SGI, IBM  DCHS04Y, 3030> disk fixed
sd0: 4340 MB, 6077 cyl, 9 head, 162 sec, 512 bytes/sect x 8888543 =
sectors
sd0: async, 8-bit transfers, tagged queueing
cd0 at scsibus0 target 4 lun 0: <TOSHIBA, CD-ROM XM-5701TA, 0167> cdrom =
removabl
e
cd0: async, 8-bit transfers
boot device: sd0
root on sd0a dumps on sd0b
root file system type: ffs
/: bad dir ino 393163 at offset 512: mangled entry
panic: bad dir


-----Original Message-----
From: Manuel Bouyer [mailto:bouyer@antioche.eu.org]
Sent: Mon 11/13/2006 12:48 PM
To: Christopher Padwick
Cc: port-sgimips@NetBSD.org
Subject: Re: netbsd autoreboot problem on SGI O2
=20
On Mon, Nov 13, 2006 at 08:08:58AM -0700, Christopher Padwick wrote:
> Hi Manuel,
>=20
> Thanks for the instructions.  The trace command says curlwp NULL, =
which I assume means that the kernel is crashing on a null pointer.

This may be normal if it's in interrupt context. Please provide the
stack trace if you see this again.

--=20
Manuel Bouyer <bouyer@antioche.eu.org>
     NetBSD: 26 ans d'experience feront toujours la difference
--



------_=_NextPart_001_01C707A8.7522F15F
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7638.1">
<TITLE>RE: netbsd autoreboot problem on SGI O2</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>Hi Manuel,<BR>
<BR>
Here is the stack trace:<BR>
<BR>
db&gt; trace<BR>
8036c4b4+214 (8ffff000,bf390000,0,d) ra 802bb69c sz 0<BR>
802bb50c+190 (8ffff000,d,0,72) ra 80250b00 sz 40<BR>
80250af4+c (8ffff000,d,0,72) ra 0 sz 0<BR>
User-level: pid 72.1<BR>
<BR>
anything useful in there?&nbsp; I've also got the output of dmesg here =
too.&nbsp; I notice that it's panicing about a &quot;bad dir ino&quot; =
down at the bottom.&nbsp; Thsi is new, it wasn't doing this =
before.&nbsp; Any way to repair that?<BR>
<BR>
db&gt; dmesg<BR>
Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, =
2005<BR>
&nbsp;&nbsp;&nbsp; The NetBSD Foundation, Inc.&nbsp; All rights =
reserved.<BR>
Copyright (c) 1982, 1986, 1989, 1991, 1993<BR>
&nbsp;&nbsp;&nbsp; The Regents of the University of California.&nbsp; =
All rights reserved.<BR>
<BR>
NetBSD 3.0.2 (GENERIC32_IP3x) #0: Wed Nov&nbsp; 1 06:04:52 UTC 2006<BR>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
builds@b0.netbsd.org:/home/builds/ab/netbsd-3-0-2-RELEASE/sgimips/200610<=
BR>
311952Z-obj/home/builds/ab/netbsd-3-0-2-RELEASE/src/sys/arch/sgimips/comp=
ile/GEN<BR>
ERIC32_IP3x<BR>
total memory =3D 256 MB<BR>
(6848 KB reserved for ARCS)<BR>
avail memory =3D 238 MB<BR>
mainbus0 (root): SGI-IP32 [SGI, 0], 1 processor<BR>
cpu0 at mainbus0: MIPS R5000 CPU (0x2321) Rev. 2.1 with built-in FPU =
Rev. 1.0<BR>
cpu0: 32KB/32B 2-way set-associative L1 Instruction cache, 48 TLB =
entries<BR>
cpu0: 32KB/32B 2-way set-associative write-back L1 Data cache<BR>
cpu0: 512KB/32B direct-mapped write-through L2 Unified cache<BR>
crime0 at mainbus0 addr 0x14000000: rev 1.1 (CRIME_ID: a1)<BR>
mace0 at mainbus0 addr 0x1f000000<BR>
lpt0 at mace0 offset 0x380000 intr 4 intrmask 0xf0000<BR>
mace: established interrupt 4 (level f0000)<BR>
com0 at mace0 offset 0x390000 intr 4 intrmask 0x3f00000: ns16550a, =
working fifo<BR>
com0: console<BR>
mace: established interrupt 4 (level 3f00000)<BR>
com1 at mace0 offset 0x398000 intr 4 intrmask 0xfc000000: ns16550a, =
working fifo<BR>
mace: established interrupt 4 (level fc000000)<BR>
pckbc0 at mace0 offset 0x320000 intr 5 intrmask 0x0<BR>
mcclock0 at mace0 offset 0x3a0000 intrmask 0x0<BR>
mec0 at mace0 offset 0x280000 intr 3 intrmask 0x0: MAC-110 Ethernet, rev =
1<BR>
mec0: Ethernet address 08:00:69:0c:58:80<BR>
nsphy0 at mec0 phy 8: DP83840 10/100 media interface, rev. 1<BR>
nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto<BR>
mace: established interrupt 3 (level 0)<BR>
macepci0 at mace0 offset 0x80000 intr 7 intrmask 0x0: rev 1<BR>
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x00001000 (size =
0x100)<BR>
pci_addr_fixup: 000:01:0 0x9004 0x8078 new address 0x80100000 (size =
0x1000)<BR>
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x00002000 (size =
0x100)<BR>
pci_addr_fixup: 000:02:0 0x9004 0x8078 new address 0x80200000 (size =
0x1000)<BR>
mace: established interrupt 7 (level 0)<BR>
pci0 at macepci0 bus 0<BR>
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok<BR>
ahc0 at pci0 dev 1 function 0: Adaptec aic7880 Ultra SCSI adapter<BR>
mace: established interrupt 8 (level 0)<BR>
ahc0: interrupting at crime interrupt 8<BR>
ahc0: Using left over BIOS settings<BR>
ahc0: aic7880: Wide Channel A, SCSI Id=3D0, 16/253 SCBs<BR>
scsibus0 at ahc0: 16 targets, 8 luns per target<BR>
ahc1 at pci0 dev 2 function 0: Adaptec aic7880 Ultra SCSI adapter<BR>
mace: established interrupt 9 (level 0)<BR>
ahc1: interrupting at crime interrupt 9<BR>
ahc1: Using left over BIOS settings<BR>
ahc1: aic7880: Wide Channel A, SCSI Id=3D0, 16/253 SCBs<BR>
scsibus1 at ahc1: 16 targets, 8 luns per target<BR>
biomask 07 netmask 07 ttymask 07 clockmask 87<BR>
scsibus0: waiting 2 seconds for devices to settle...<BR>
scsibus1: waiting 2 seconds for devices to settle...<BR>
sd0 at scsibus0 target 1 lun 0: &lt;SGI, IBM&nbsp; DCHS04Y, 3030&gt; =
disk fixed<BR>
sd0: 4340 MB, 6077 cyl, 9 head, 162 sec, 512 bytes/sect x 8888543 =
sectors<BR>
sd0: async, 8-bit transfers, tagged queueing<BR>
cd0 at scsibus0 target 4 lun 0: &lt;TOSHIBA, CD-ROM XM-5701TA, 0167&gt; =
cdrom removabl<BR>
e<BR>
cd0: async, 8-bit transfers<BR>
boot device: sd0<BR>
root on sd0a dumps on sd0b<BR>
root file system type: ffs<BR>
/: bad dir ino 393163 at offset 512: mangled entry<BR>
panic: bad dir<BR>
<BR>
<BR>
-----Original Message-----<BR>
From: Manuel Bouyer [<A =
HREF=3D"mailto:bouyer@antioche.eu.org">mailto:bouyer@antioche.eu.org</A>]=
<BR>
Sent: Mon 11/13/2006 12:48 PM<BR>
To: Christopher Padwick<BR>
Cc: port-sgimips@NetBSD.org<BR>
Subject: Re: netbsd autoreboot problem on SGI O2<BR>
<BR>
On Mon, Nov 13, 2006 at 08:08:58AM -0700, Christopher Padwick wrote:<BR>
&gt; Hi Manuel,<BR>
&gt;<BR>
&gt; Thanks for the instructions.&nbsp; The trace command says curlwp =
NULL, which I assume means that the kernel is crashing on a null =
pointer.<BR>
<BR>
This may be normal if it's in interrupt context. Please provide the<BR>
stack trace if you see this again.<BR>
<BR>
--<BR>
Manuel Bouyer &lt;bouyer@antioche.eu.org&gt;<BR>
&nbsp;&nbsp;&nbsp;&nbsp; NetBSD: 26 ans d'experience feront toujours la =
difference<BR>
--<BR>
<BR>
<BR>
</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C707A8.7522F15F--