NetBSD-Bugs archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: kern/53005: apache httpd can hang the system



The following reply was made to PR kern/53005; it has been noted by GNATS.

From: clare%csel.org@localhost
To: gnats-bugs%NetBSD.org@localhost
Cc: Manuel Bouyer <bouyer%antioche.eu.org@localhost>, kern-bug-people%netbsd.org@localhost,
 gnats-admin%netbsd.org@localhost, netbsd-bugs%netbsd.org@localhost
Subject: Re: kern/53005: apache httpd can hang the system
Date: Wed, 14 Feb 2018 00:41:30 +0900

 On Mon, 12 Feb 2018 19:20:01 +0000 (UTC)
 Manuel Bouyer <bouyer%antioche.eu.org@localhost> wrote:
 
 > From: Manuel Bouyer <bouyer%antioche.eu.org@localhost>
 > To: clare%csel.org@localhost
 > Cc: gnats-bugs%NetBSD.org@localhost, kern-bug-people%netbsd.org@localhost, gnats-admin%netbsd.org@localhost,
 >         netbsd-bugs%netbsd.org@localhost
 > Subject: Re: kern/53005: apache httpd can hang the system
 > Date: Mon, 12 Feb 2018 20:18:06 +0100
 > 
 >  On Mon, Feb 12, 2018 at 03:40:38PM +0900, clare%csel.org@localhost wrote:
 >  > >  When it hangs, can you enter ddb and get a backtrace of the apache process ?
 >  > 
 >  > db{0}> bt
 >  > breakpoint() at netbsd:breakpoint+0x5
 >  > wskbd_translate() at netbsd:wskbd_translate+0xbb4
 >  > wskbd_input() at netbsd:wskbd_input+0x5b
 >  > pckbd_input() at netbsd:pckbd_input+0x6b
 >  > pckbcintr() at netbsd:pckbcintr+0x8d
 >  > intr_bloglock_wrapper() at netbsd:intr_biglock_wrapper+0x1d
 >  > Xintr_ioapic_edge4() at netbsd:Xintr_ioapic_edge4+0xf1
 >  > --- interrupt ---
 >  > exit_lwps() at netbsd:exit_lwps+0x58
 >  > exit1() at netbsd:exit1+0x68
 >  > sys_exit() at netbsd:sys_eit+0x3d
 >  > syscall() at netbsd:syscall+0x1d8
 >  > --- syscall (number 1) ---
 >  > 
 >  > db{0}> trace/t 50
 >  > trace: pid 80 lid 40 at 0xffff80002a503db0
 >  > ?() at ffffe4001fa3b5c0
 >  > lwp_exit_switchaway() at netbsd:lwp_exit_switchaway+0x1ac
 >  > Bad frame pointer: 0xffffe4001a1c8300
 >  
 >  Do you want to backtrace pid 50 (decimal) or 80 (decimal) ?
 >  if you want to trace pid 50, you have to use:
 >  trace/t 0t50
 >  (numbers are interpreted as hex by default).
 
 the PID was 80 as of showing ps command in ddb.
 i thounght 80 was shown in decimal.
 
 screenshot is here:
 https://www.csel.org/netbsd/pr/53005/ddb-2018-2-12.png
 
 
 >  Also it would be interesting to see which pid was interupted by entering ddb.
 >  I wonder if it's stuck looping in exit_lwps()
 
 I cannot reproduce the problem on the bare metal multiprocessor local
 machine, currently.
 
 when I enabled both LOCKDEBUG and DEBUG options, then problem was gone.
 
 when I enabled DEBUG and without LOCKDEBUG options, then problem was gone.
 
 when I enabled LOCKDEBUG and without DEBUG options, then problem was appeared,
 screenshots are in https://www.csel.org/netbsd/pr/53005/lockdebug/
 
 
 -- 
 Shinichi Doyashiki <clare%csel.org@localhost>
 


Home | Main Index | Thread Index | Old Index