Subject: Re: smp crash under network load
To: Frank van der Linden <fvdl@wasabisystems.com>
From: john heasley <heas@shrubbery.net>
List: tech-smp
Date: 12/06/2002 10:53:07
Fri, Dec 06, 2002 at 10:34:11AM +0100, Frank van der Linden:
> On Thu, Dec 05, 2002 at 10:15:09PM +1030, Brett Lymn wrote:
> > 
> > Folks,
> >         I have just brought my dual PPro up in mp mode, now when I do
> > any moderately heavy network traffic the machine drops to ddb with a
> > message of:
> > 
> >  "ex0: uplistptr was 0"
> 
> That's not a fatal error, it won't drop you into DDB.. Please make sure
> that your sources are completely up to date, and if you're still seeing
> the problem, send a PR with tracebacks.

i had a number of these on my tyan S2466.  i doubled EX_NUPD

Index: sys/dev/ic/elinkxlreg.h
===================================================================
RCS file: /cvsroot/syssrc/sys/dev/ic/elinkxlreg.h,v
retrieving revision 1.11
diff -r1.11 elinkxlreg.h
217c217
< #define EX_NUPD               128
---
> #define EX_NUPD               256

and it seemed to stop, but panics in ex_intr() did not.  below is one
trace that i caught about 3 weeks ago, after which i stuck an fxp in
the box.  i dont think this SMP related.

login: kernel: page fault trap, code=0
Stopped at      ex_intr+0x1c1:  movl    %eax,         0(%edx)
db{1}> 
db{1}> trace
ex_intr(c164b000) at ex_intr+0x1c1
Xintrioapic4() at Xintrioapic4+0x69
--- interrupt ---
idle_loop+0xc:

db{1}> show reg
ds                0x10
es                0x10
fs                0x30
gs                0x10
edi         0xc164b000  end+0x12c7e64
esi         0xc1667e00  end+0x12e4c64
ebp         0xe3446f48
ebx                  0
edx                  0
ecx         0xc164cd10  end+0x12c9b74
eax         0xc164cd10  end+0x12c9b74
eip         0xc014f3b9  ex_intr+0x1c1
cs                 0x8
eflags         0x10282  gdt_desc+0xe1d6
esp         0xe3446f10
ss                0x10
ex_intr+0x1c1:  movl    %eax,         0(%edx)

db{1}> sync
syncing disks... ex0: uplistptr was 0
kernel: page fault trap, code=0
Stopped at      genfs_putpages+0x30f:   movl         0x10c(%eax),%edx