Subject: kern/27139: panic while making world on NetBSD/Alpha 2.0_RC2 (UP1000)
To: None <gnats-bugs@gnats.NetBSD.org>
From: None <hhako@seagreen.ocn.ne.jp>
List: netbsd-bugs
Date: 10/04/2004 11:44:12
>Number:         27139
>Category:       kern
>Synopsis:       panic while making world on NetBSD/Alpha 2.0_RC2 (UP1000)
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    kern-bug-people
>State:          open
>Class:          sw-bug
>Submitter-Id:   net
>Arrival-Date:   Mon Oct 04 11:45:01 UTC 2004
>Closed-Date:
>Last-Modified:
>Originator:     Hiroshi Hakoyama
>Release:        NetBSD 2.0_RC2  Oct 4, 2004
>Organization:
NRIFS
>Environment:
NetBSD eco-math20.nrifs.affrc.go.jp 2.0_RC2 NetBSD 2.0_RC2 (UP1000) #0: Mon Oct  4 15:35:24 JST 2004  hako@eco-math20.nrifs.affrc.go.jp:/usr/src/sys/arch/alpha/compile/UP1000 alpha

>Description:
NetBSD 2.0_RC2(or RC1) works on API UP1000 (Alpha) basically.
However, when I trying build userland, I have always seen some panics.
The panics seems to occur during heavy jobs.
The panic messages are not same for each trials.

I also sometimes have seen a message in dmesg:
"/netbsd: Warning: received processor correctable error.", which is not seen in NetBSD 1.6.2. I used /usr/pkg/bin/memtester to check memory, but UP1000 passed all tests I did.

In all cases, I could not get crash dumps, because it hung during syncing disk.
Here I write two panic messages which is transcribed from monitor by hand.

panic: trap
Stopped in pid 18488.1 (rm) at netbsd: cpu_Debugger+0x4: ret zero, (ra)
db> trace
cpu_Debugger() at netbsd: cpu_Debugger+0x4
panic() at netbsd: panic +0x1f8
trap() at netbsd: trap +0x128
XentMM() at netbsd: XentMM +0x20
--- memory management fault (from ipl 0) ---
genfs-putpages() at netbsd: genfs_putpages + 0x338
db> sync
syncing disks ... tlp0: receive ring overrun
panic: lockmgr: locking against myself
Stopped in pid 18488.1 (rm) at netbsd: cpu_Debugger +0x4: ret zero, (ra)
db> reboot
panic: wdc_exec_command: polled command not done
Stopped in pid 18488.1 (rm) at netbsd: cpu_Debugger + 0x4: ret zer, (ra)


panic: pool_get (vmsppl): free list modified: 
magic = 17810e0; page 0xfffffc0020c3c000;
item dddr 0xfffffc00020c3d588
Stopped in pid 6301.1 (sh) at netbsd:
db> trace
cpu_Debugger() at netbsd: cpu_Debugger+0x4
panic() at netbsd: panic + 0x1d8
pool_get() at netbsd: pool_get + 0x3ec
uvmspace_alloc() at netbsd: uvmspace_alloc+0x34
uvmspace_exec() at netbsd: uvmspace_exec+0x50
sys_execve() at netbsd: sys_execve+0x654
syscall_plain() at netbsd: syscall_plain+0xb4
XentSys() at netbsd: XentSys+0x5c
--- syscall(59) ---
--- user mode ---
db> sync
syncing disks ... tlp0: receive ring overrun


>How-To-Repeat:
On NetBSD/Alpha (UP1000) 2.0_RC2,

# /usr/src/build.sh -U distribution 
>Fix:
unknown
>Release-Note:
>Audit-Trail:
>Unformatted: