NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: misc/43182: Reboot due to system crash



The following reply was made to PR kern/43182; it has been noted by GNATS.

From: =?UTF-8?Q?Fr=C3=A9d=C3=A9ric_Fauberteau?= 
<frederic%fauberteau.org@localhost>
To: <gnats-bugs%NetBSD.org@localhost>
Cc: 
Subject: Re: misc/43182: Reboot due to system crash
Date: Fri, 23 Apr 2010 08:23:25 +0200

 On Fri, 23 Apr 2010 04:55:01 +0000 (UTC), David Holland
 <dholland-bugs%netbsd.org@localhost> wrote:
 > The following reply was made to PR misc/43182; it has been noted by
 GNATS.
 > 
 > From: David Holland <dholland-bugs%netbsd.org@localhost>
 > To: gnats-bugs%NetBSD.org@localhost
 > Cc: 
 > Subject: Re: misc/43182: Reboot due to system crash
 > Date: Fri, 23 Apr 2010 04:51:40 +0000
 > 
 >  On Mon, Apr 19, 2010 at 04:30:00PM +0000, frederic%fauberteau.org@localhost
 wrote:
 >   > I apologize for my inexperience in bug reporting (this is my first
 >   > one).
 >  
 >  You're not doing badly :-)
 >  
 >   > Mar  7 20:56:54 /netbsd: umass0: BBB bulk-out clear stall failed,
 >   > IOERROR
 >   > Mar  7 20:56:54 /netbsd: uvm_fault(0xcc6309f8, 0, 1) -> 0xe
 >   > Mar  7 20:56:54 /netbsd: fatal page fault in supervisor mode
 >   > Mar  7 20:56:54 /netbsd: trap type 6 code 0 eip c03b0ea5 cs 8 eflags
 >   > 10246 cr2 0 ilevel 0
 >  
 >  (1)
 >  
 >   > Apr  3 07:15:22 /netbsd: fatal page fault in supervisor mode
 >   > Apr  3 07:15:22 /netbsd: trap type 6 code 2 eip c046a188 cs 8 eflags
 >   > 10292 cr2 cc666c84 ilevel 2
 >  
 >  (2)
 >  
 >   > Apr  3 07:15:22 /netbsd: dumping to dev 0,1 offset 3180407
 >   > Apr  3 07:15:22 /netbsd: dump succeeded
 >   > Apr  3 07:15:22 /netbsd: 
 >   > Apr  3 07:15:22 /netbsd: 
 >   > Apr  3 07:15:22 /netbsd: sd0(umass0:0:0:0): generic HBA error
 >   > Apr  3 07:15:22 /netbsd: fatal page fault in supervisor mode
 >   > Apr  3 07:15:22 /netbsd: trap type 6 code 0 eip 0 cs 8 eflags 10246
 >   > cr2 0 ilevel 4
 >   > Apr  3 07:15:22 /netbsd: panic: trap
 >   > Apr  3 07:15:22 /netbsd: Faulted in mid-traceback; aborting...
 >  
 >  (3)
 >  
 >   > Apr 11 21:15:38 /netbsd: fatal page fault in supervisor mode
 >   > Apr 11 21:15:38 /netbsd: trap type 6 code 2 eip c046a188 cs 8 eflags
 >   > 10292 cr2 cc863c84 ilevel 2
 >  
 >  same as (2).
 >  
 >   > And the crash occurs all the day until today. Maybe a bug with the
 >   > USB mass storage ...
 >  
 >  Yes, quite likely, although (1) and (2) may actually be different
 >  problems, and I'm curious why in (3) it seems to be accessing sd0 on
 >  umass0 after doing a crashdump to wd0b. (0,1 is wd0b; ls -l /dev/wd0b.)
 >  Ordinarily by that point in crashing it shouldn't be touching anything
 >  besides the system console.
 >  
 >   > Have you some advices to help me to produce better informations to
 >   > understand this problem ?
 >  
 >  Some things that would probably be helpful to know:
 >  
 >  (a) What's umass0 attached to? Is it going through an ehci, uhci, or
 >  ohci USB controller, and what's in between? (The easiest way to answer
 >  this question is to forward a boot log from /var/run/dmesg.boot.)
 >  Also it might be useful to what your USB device calls itself, which
 >  will also be in the boot log.
 >  
 >  (b) Is there anything unusual you're doing with the USB device that
 >  might explain what's happening in case (3)? 
 >  
 >  (c) Where did crashes (1) and (2) happen? If you feel up to it, run
 >  "nm -n /netbsd | less" and find the last name before the EIP address
 >  from the crash (c03b0ea5 in case (1), c046a188 in case (2)) -- this is
 >  the name of the function it died in. In my kernel c03b0ea5 is between
 >  these:
 >     c03b0e60 T i4b_dl_release_ind
 >     c03b0f30 T i4b_dl_establish_cnf
 >  
 >  but that doesn't mean anything; it'll be different in yours. (If you
 >  can't do this, because you're using the prebuild 5.0.2 GENERIC kernel
 >  someone else can; but if you can, it saves waiting for someone else to
 >  get around to downloading that kernel and checking.)
 >  
 >  Note that crash (3) jumped to 0 and trying to look that up won't yield
 >  anything particularly interesting. :-/
 >  
 >  Unfortunately, there are a number of more-or-less known but unsolved
 >  problems with umass...
 >  
 >  -- 
 >  David A. Holland
 >  dholland%netbsd.org@localhost
 
 In case (1) c03b0ea5 is between :
   c03b0e70 T ext2fs_inactive
   c03b0fc0 T ext2fs_checkpath
 In case (2) c046a188 is between :
   c046a1d0 T kpsignal
   c046a270 T kpgsignal
 
 My USB disk was mounted is ext2fs. I has umounted it but my kernel
 continue to panic. In ddb mode, I obtained this trace :
 
 
kpsignal2(cc552cf4,cad21d04,cad21d04,cbd41a5c,cc552cf4,cad21d24,cad21d40,c0476cd9,cc552cf4,cad21d04)
 at netbsd:kpsignal2+0x5a8
 kpsignal(cc552cf4,cad21d04,0,10c,10c,cad21d04,cad21d40,0,0,0) at
 netbsd:kpsignal+0x7a
 timer_intr(0,ca920010,ca920030,cad20010,ca920010,0,a3d360,c16b0400,0,cad21da0)
 at netbsd:timer_intr+0x229
 softint_dispatch(ca927c80,2,0,0,0,0,cad21d90,cad21ce4,cad21d00,0) at
 netbsd:softint_dispatch+0x64
 DDB lost frame for netbsd:Xsoftintr+0x3d, trying 0xcad21d88
 Xsoftintr() at netbsd:Xsoftintr+0x3d
 --- interrupt ---
 fatal page fault in supervisor mode
 trap type 6 code 0 eip c053f8a7 cs 8 eflags 10206 cr2 3a ilevel 8
 kernel: supervisor trap page fault, code=0
 Faulted in DDB; continuing...
 
 but I don't know if it is usefull ... I don't know why, but the dump of
 the kernel fails (nothin in /var/crash except 'minfree' file).
 I have compile my kernel from sources. Now I'm waiting for a panic and I
 know that nm is my friend ;)
 
 --
 Frédéric Fauberteau
 frederic%fauberteau.org@localhost
 


Home | Main Index | Thread Index | Old Index