Subject: memory management fault?
To: None <port-alpha@netbsd.org>
From: Aaron J. Grier <agrier@poofygoof.com>
List: port-alpha
Date: 01/14/2007 00:16:39
running:

NetBSD arwen.poofy.goof.com 4.99.7 NetBSD 4.99.7 (GENERIC) #0: Sat Dec 30 06:27:08 PST 2006  agrier@arwen.poofy.goof.com:/usr/obj/amd/aragorn/usr/projects/NetBSD/src/sys/arch/alpha/compile/GENERIC alpha

getting the following:

CPU 0: fatal kernel trap:

CPU 0    trap entry = 0x2 (memory management fault)
CPU 0    a0         = 0xfffffe0108266000
CPU 0    a1         = 0x1
CPU 0    a2         = 0x0
CPU 0    pc         = 0xfffffc000081cbc0
CPU 0    ra         = 0xfffffc000035d3ec
CPU 0    pv         = 0x0
CPU 0    curlwp    = 0xfffffc000fcb0660
CPU 0        pid = 334, comm = nfsio

panic: trap
Begin traceback...
alpha trace requires known PC =eject=
End traceback...
syncing disks... 
CPU 0: fatal kernel trap:

CPU 0    trap entry = 0x2 (memory management fault)
CPU 0    a0         = 0xfffffe0108266000
CPU 0    a1         = 0x1
CPU 0    a2         = 0x0
CPU 0    pc         = 0xfffffc000081cbc0
CPU 0    ra         = 0xfffffc000035d3ec
CPU 0    pv         = 0x0
CPU 0    curlwp    = 0xfffffc000fcb1760
CPU 0        pid = 7, comm = aiodoned

panic: trap
Begin traceback...
alpha trace requires known PC =eject=
End traceback...

it's happened roughly once a week since I've been running 4.99.7.

some programs linked with libkvm seem a bit funny too:

# mlxctl -a status
mlxctl: can't dereference kptr 0x1ffffe128
mlxctl: kvm_read: Bad address

I don't know if that's related or not.

the machine is a 1000A 5/400:

NetBSD 4.99.7 (GENERIC) #0: Sat Dec 30 06:27:08 PST 2006
        agrier@arwen.poofy.goof.com:/usr/obj/amd/aragorn/usr/projects/NetBSD/src/sys/arch/alpha/compile/GENERIC 
AlphaServer 1000A 5/400, 400MHz, s/n
8192 byte page size, 1 processor.
total memory = 256 MB
(2016 KB reserved for PROM, 254 MB used by NetBSD) 
avail memory = 241 MB
mainbus0 (root)
cpu0 at mainbus0: ID 0 (primary), 21164A-2
cpu0: Architecture extensions: 1<BWX>
cia0 at mainbus0: DECchip 2117x Core Logic Chipset (ALCOR/ALCOR2), pass 3
cia0: extended capabilities: 21<DWEN,BWEN>
cia0: using BWX for PCI config access
[...etc... full dmesg on request]

-- 
  Aaron J. Grier | "Not your ordinary poofy goof." | agrier@poofygoof.com
              "silly brewer, saaz are for pils!"  --  virt