Subject: Re: NetBSD 1.5Y of Oct 3 panics
To: =?iso-8859-1?Q?Staffan_Thom=E9n?= <duck@mult.fi>
From: Chuck Silvers <chuq@chuq.com>
List: port-alpha
Date: 10/16/2001 23:33:21
hi,

the first panic has been reported by a couple other people now too,
but I haven't had a chance to look at it yet.

the stack trace in the second panic looks pretty messed up,
since ltsleep() doesn't call uvm_aio_aiodone(),
but there was a bug with that panic message that was fixed last week,
so updating your kernel wouldn't be a bad idea.

-Chuck


On Tue, Oct 16, 2001 at 11:25:47AM +0300, Staffan Thomén wrote:
>  Hi,
>   My AlphaPC164 has gotten into the bad habit of panicing after my update to
>   1.5Y on the 3:rd of October. It has done so twice under some (not idle) 
>   network/filesystem load.
> 
>   I didn't manage to get a dump for some reason the first time, but these
>   are the messages I caught in dmesg.
> 
> --- 8< ---
> 
> panic: kernel diagnostic assertion "pcg->pcg_objects[idx] == NULL" failed:
> file "../../../../kern/subr_pool.c", line 1646
> 
> syncing disks... 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18 18
> giving up
> 
> dumping to dev 8,1 offset 536001
> dump i/o error
> 
> 
> sd0(ahc0:0:2:0): polling command not done
> panic: scsipi_execute_xs
> 
> dumping to dev 8,1 offset 536001
> dump device not ready
> 
> --- 8< ---
>  
>   On the second occasion, I had the presence of mind to do a trace, and also
>   received a dump. Tell me if you want it or what you want me to do with it.
> 
>   Trace as follows
> 
> --- 8< ---
> 
> panic: softdep_pageiodone: resid < 0, vp 0xfffffc0007a299a0 lbn 0x0 pcbp 0xfffffc0003b82b50
> Stopped in pid 147 (nfsd) at   cpu_Debugger+0x4:        ret     zero,(ra)
> db> t
> cpu_Debugger() at cpu_Debugger+0x4
> panic() at panic+0x15c
> softdep_pageiodone() at softdep_pageiodone+0x188
> uvm_aio_aiodone() at uvm_aio_aiodone+0xe4
> ltsleep() at ltsleep+0x318
> genfs_gop_write() at genfs_gop_write+0x454
> *ABS*() at      0x4
> 
> CPU 0: fatal kernel trap:
> 
> CPU 0   trap entry      = 0x2 (memory management fault)
> CPU 0   a0              = 0x0
> CPU 0   a1              = 0x1
> CPU 0   a2              = 0x0
> CPU 0   pc              = 0xfffffc00004e54f0
> CPU 0   ra              = 0xfffffc00004e5410
> CPU 0   pv              = 0xfffffc0000325500
> CPU 0   curproc         = 0xfffffc00034b6b68
> CPU 0           pid = 147, comm = nfsd
> 
> Caught exception in ddb.
> 
> db> sync
> syncing disks... ex0: splistptr was 0
> panic lockmgr: locking against myself
> Stopped in pid 147 (nfsd) at    cpu_Debugger+0x4:       ret     zero,(ra)
> db> sync
> 
> dumping to dev 8,1 offset 536001
> [...]
> 
> --- 8< ---
> 
>   If you want, I can make the dump available if anyone wishes to have a look,
>   or tell me what to do.
>   The box has worked *perfectly* up til these two occasions.
> 
>  /Staffan
> 
> --
> -- Staffan Thomén (0x00C37E1F) ---------------------------------------------
> Totally illogical, there was no chance.
> 		-- Spock, "The Galileo Seven", stardate 2822.3