Subject: Re: NMI on Compaq 1850R (was Re: cac problems with 2.0E-20040517)
To: None <current-users@NetBSD.org>
From: Chris Ross <cross+netbsd@distal.com>
List: current-users
Date: 06/13/2004 18:28:40
On Jun 12, 2004, at 04:38, Chris Ross wrote:
>   Now that that was ruled out, I tried the only other thing I
> could think to try.  I removed the upper 512MB of memory.  (I
> had 4 256MB DIMMs.  Now have only 2 256MB DIMMs.)  It will reliably
> install now, and doesn't seem to have any problem running.
>
>   However, I know this system was running just fine with BSD/OS
> on it, with all 1 GB of memory.  And, I know the BSD/OS kernel
> was configured to panic on parity-errors, so I wouldn't think
> that hardware-detected parity-errors are the problem.

   Okay.  It seems to run just fine with the 2 256MB DIMMs in it, but
if I put back either of the remaining ones, it will get an NMI (from
memcpy, called out of the uvm_ code).  Similar if I but them both
back.

   The trace I see right now (running with 1GB of memory) is:

memcpy(1f891000,20012000,0,6,c07cba80) at netbsd:memcpy+0x15:	repe 
movsl	(%esi),%es:(%edi)
uvm_fault(cbe5ee9c,bfbfd000,0,2,0) at netbsd:uvm_fault+0x523
trap() at netbsd:trap+0x38d
--- trap (number 6) ---
0x480e2710

   The x86 assembly I saw for the others looked the same.  Possibly
the same place in memcpy(), definately in those cases called from
uvm_something, but I don't think they too were uvm_fault().

   Does anyone have an x86 machine running GENERIC, with more
than 512 MB of memory?  I'd like to know if someone else could load
up the 20040517-2.0E snapshot, and have this *not* happen.

    I'm running the kernel I got in the binary sets, KERN-GENERIC.

                                   - Chris