Subject: Re: More amd64 instability
To: None <current-users@netbsd.org>
From: Greg Oster <oster@cs.usask.ca>
List: current-users
Date: 11/20/2007 16:54:27
Nicolas Joly writes:
> On Tue, Nov 20, 2007 at 04:19:26PM -0600, Greg Oster wrote:
> > Paul Goyette writes:
> > > On Tue, 20 Nov 2007, Juan RP wrote:
> > > 
> > > > I see that sysmon_envsys* files in your kernel are not the newest ones,
> > > > could you please try updating? the locking in sme_events_check() was ch
> ange
> > > d
> > > > recently.
> > > 
> > > Well, I did a 'cvs update' in /usr/src/sys then rebuilt my kernel.  It's 
> > > worse than before!
> > > 
> > > Right after it successfully probes the azalia0, I get
> > > 
> > > uvm_fault(0xffffffff805e8c80, 0x0, 2) -> e
> > > kernel: page fault trap, code=0
> > > Stopped in pid 0.1 (system) at netbsd:softintr_schedule+0x60:  movq %r12,
>  0(%
> > > rax)
> > 
> > here's what I just got:
> 
> Me too :-(
> 
> With an up to date DIAGNOSTIC+LOCKDEBUG kernel:
> 
> uvm_fault(0xffffffff80c027a0, 0x0, 2) -> e
> kernel: page fault trap, code=0
> Stopped in pid 0.1 (system) at  netbsd:softintr_schedule+0x60:  movq    %r12,
> 0(%rax)
> db{0}> Kernel lock error: _kernel_lock: spinout
[snip]

Here are the 'ident' diffs between a kernel that works and one that 
doesn't... 

1c1
< /netbsd:   (this one works)
---
> ./netbsd:  (this one panics)
40,41c40,41
<      $NetBSD: sysmon_envsys.c,v 1.72 2007/11/16 08:00:16 xtraeme Exp $
<      $NetBSD: sysmon_envsys_events.c,v 1.45 2007/11/16 08:00:16 xtraeme Exp $
---
>      $NetBSD: sysmon_envsys.c,v 1.74 2007/11/20 17:27:35 xtraeme Exp $
>      $NetBSD: sysmon_envsys_events.c,v 1.46 2007/11/20 17:24:32 xtraeme Exp $
167c167
<      $NetBSD: irframe_tty.c,v 1.46 2007/11/10 18:29:37 ad Exp $
---
>      $NetBSD: irframe_tty.c,v 1.47 2007/11/19 18:51:48 ad Exp $
318c318
<      $NetBSD: cy.c,v 1.53 2007/11/07 15:56:14 ad Exp $
---
>      $NetBSD: cy.c,v 1.54 2007/11/19 18:51:47 ad Exp $
328c328
<      $NetBSD: com.c,v 1.265 2007/10/19 11:59:49 ad Exp $
---
>      $NetBSD: com.c,v 1.266 2007/11/19 18:51:47 ad Exp $
391c391
<      $NetBSD: puffs_msgif.c,v 1.58 2007/11/17 18:03:15 pooka Exp $
---
>      $NetBSD: puffs_msgif.c,v 1.59 2007/11/20 11:51:01 pooka Exp $
394,395c394,395
<      $NetBSD: puffs_vfsops.c,v 1.69 2007/11/16 20:32:18 pooka Exp $
<      $NetBSD: puffs_vnops.c,v 1.117 2007/11/17 21:55:29 pooka Exp $
---
>      $NetBSD: puffs_vfsops.c,v 1.70 2007/11/20 11:51:02 pooka Exp $
>      $NetBSD: puffs_vnops.c,v 1.118 2007/11/20 11:51:02 pooka Exp $
617c617
<      $NetBSD: subr_devsw.c,v 1.13 2007/11/07 00:23:22 ad Exp $
---
>      $NetBSD: subr_devsw.c,v 1.14 2007/11/20 14:47:14 pooka Exp $
652c652
<      $NetBSD: tty.c,v 1.202 2007/11/14 01:15:31 ad Exp $
---
>      $NetBSD: tty.c,v 1.205 2007/11/20 00:45:23 ad Exp $
655c655
<      $NetBSD: tty_pty.c,v 1.102 2007/11/07 15:56:22 ad Exp $
---
>      $NetBSD: tty_pty.c,v 1.104 2007/11/19 19:47:00 ad Exp $
657c657
<      $NetBSD: tty_subr.c,v 1.30 2007/11/07 15:56:22 ad Exp $
---
>      $NetBSD: tty_subr.c,v 1.31 2007/11/19 18:51:52 ad Exp $
792c792
<      $NetBSD: machdep.c,v 1.66 2007/11/12 19:02:28 ad Exp $
---
>      $NetBSD: machdep.c,v 1.67 2007/11/19 19:02:20 ad Exp $
902c902
<      $NetBSD: cz.c,v 1.46 2007/11/12 14:20:41 ad Exp $
---
>      $NetBSD: cz.c,v 1.47 2007/11/19 18:51:49 ad Exp $
1017c1017
<      $NetBSD: wsdisplay.c,v 1.111 2007/10/18 21:08:18 joerg Exp $
---
>      $NetBSD: wsdisplay.c,v 1.112 2007/11/19 18:51:51 ad Exp $
1125c1125
<      $NetBSD: ucom.c,v 1.72 2007/11/12 14:20:41 ad Exp $
---
>      $NetBSD: ucom.c,v 1.73 2007/11/19 18:51:50 ad Exp $
1133c1133
<      $NetBSD: ucycom.c,v 1.18 2007/11/10 18:29:37 ad Exp $
---
>      $NetBSD: ucycom.c,v 1.19 2007/11/19 18:51:51 ad Exp $
1146c1146
<      $NetBSD: if_aue.c,v 1.104 2007/09/01 07:32:32 dyoung Exp $
---
>      $NetBSD: if_aue.c,v 1.105 2007/11/20 10:43:01 sborrill Exp $

dates on source trees are Nov 19 08:46 CST for the working kernel, 
and Nov 20 13:17 CST for the non-working one.


Later...

Greg Oster