tech-kern archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: amd64 panics w/XEN_DOM0, not GENERIC



On Sun, Aug 24, 2008 at 04:18:49PM -0400, Todd Kover wrote:
> 
>  > On Wed, Aug 20, 2008 at 11:03:47PM +0200, Manuel Bouyer wrote:
>  >
>  > > Could you try a GENERIC+DEBUG+DIAGNOSTIC kernel ? XEN3_DOM0 has
>  > > options DEBUG options DIAGNOSTIC while GENERIC doesn't.
>  >
>  > Also please try a XEN3_DOM0 kernel with the attached patch. It should
>  > give more details on the source of the problem (the failing function
>  > should now show up in the stack trace).
> 
> I'm still working on the generic kernel (will run it for the first time
> tonight and the next few nights if it doesn't die on day one) but last
> night with:
> 
> NetBSD 4.99.72 (XEN3_DOM0) #7: Sat Aug 23 10:08:58 EDT 2008
>       
> kovert%saidin.omniscient.com@localhost:/usr/obj/current/amd64/amd/guinness/usr/src/os/NetBSD/src/sys/arch/amd64/compile/XEN3_DOM0
> total memory = 768 MB
> 
> (checkout earlier that day), with xen deamons running, I got what's below.
> 
> please let me know if there's more debugging I can stuff in to try to
> narrow it down more.
> 
> thanks,
> -Todd
> 
> 
> ---<snip>---
> panic: kernel diagnostic assertion "bp->b_error == 0" failed: file 
> "/amd/guinness/usr/src/os/NetBSD/src/sys/kern/kern_physio.c", line 219
> fatal breakpoint trap in supervisor mode
> trap type 1 code 0 rip ffffffff804b6d5d cs e030 rflags 246 cr2  7f7ffd90a700 
> cpl 6 rsp ffffa000352c5a80
> Stopped in pid 0.13 (system) at netbsd:breakpoint+0x5:  leave
> breakpoint() at netbsd:breakpoint+0x5
> panic() at netbsd:panic+0x255
> __kernassert() at netbsd:__kernassert+0x2d 
> physio_biodone() at netbsd:physio_biodone+0xa7
> scsipi_complete() at netbsd:scsipi_complete+0x17c
> scsipi_completion_thread() at netbsd:scsipi_completion_thread+0xba

OK. I guess it's calling physio_biodone() though stdone() (the stack frames
got optimized, so stdone() doesn't show up in the stack trace).
Can you run a kernel with the attached patch (either GENERIC of XEN3_DOM0) ?
It should give more details about what's going on.



-- 
Manuel Bouyer <bouyer%antioche.eu.org@localhost>
     NetBSD: 26 ans d'experience feront toujours la difference
--
Index: st.c
===================================================================
RCS file: /cvsroot/src/sys/dev/scsipi/st.c,v
retrieving revision 1.205
diff -u -p -u -r1.205 st.c
--- st.c        8 Jun 2008 18:18:34 -0000       1.205
+++ st.c        24 Aug 2008 21:26:14 -0000
@@ -1347,6 +1347,10 @@ stdone(struct scsipi_xfer *xs, int error
                                        st->blkno++;
                        }
                }
+               if (bp->bp->b_error) {
+                       printf("stdone error %d count %d resid %d\n",
+                            bp->b_error, bp->b_bcount, bp->b_resid);
+               }
 
                biodone(bp);
        }


Home | Main Index | Thread Index | Old Index