Current-Users archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: named spinning
In Message <20111007005955.34EB89711D%rebar.astron.com@localhost>,
christos%zoulas.com@localhost (Christos Zoulas)wrote:
=>On Oct 6, 8:49pm, gary%duzan.org@localhost (Gary Duzan) wrote:
=>-- Subject: Re: named spinning
=>
=>|
=>| ===================================================================
=>| wheel { /usr/src/external/bsd/bind/lib/libisc } #
/usr2/obj/external/bsd/bind/bin/named/named -f
=>| <>got: 0 of 1, ident: 5
=>| <>got: 0 of 1, ident: 5
=>| <>got: 0 of 1, ident: 5
=>| <
=>| ===================================================================
=>|
=>| So it is just staying in kevent(), eating CPU.
=>|
=>| 14090 root 34 0 86M 33M CPU/1 4:31 83.79% 83.79% named
=>|
=>| Here is the fstat output:
=>|
=>| ===================================================================
=>| wheel { /usr/src/external/bsd/bind/dist/lib/isc/unix } # fstat -p `pgrep
named`
=>| USER CMD PID FD MOUNT INUM MODE SZ|DV R/W
=>| root named 14090 wd / 13992411 drwxr-xr-x 512 r
=>| root named 14090 0 /dev/pts 3 crw--w---- ttyp0 rw
=>| root named 14090 1 /dev/pts 3 crw--w---- ttyp0 rw
=>| root named 14090 2 /dev/pts 3 crw--w---- ttyp0 rw
=>| root named 14090 3* unix dgram <-> ffff80000e244980
=>| root named 14090 4 / 13647756 crw-rw-rw- 3658,321633 rw
=>| root named 14090 5* pipe 0xffff8000b8f0b7e0 <-
0xffff8000948e12b8 rn
=>| root named 14090 6* pipe 0xffff8000948e12b8 ->
0xffff8000b8f0b7e0 w
=>| root named 14090 4* kqueue pending 0
=>| root named 14090 5* crypto 0xffff80000dc36f18
=>| root named 14090 9 / 13648392 cr--r--r-- 1438,320325 r
=>| [ and the port 53 sockets ]
=>| ===================================================================
=>|
=>| Any more ideas what to look for? I think some of the other things
=>| I was seeing were artifacts of being in the debugger, which the
=>| printfs remove. I could try building a debugging libc on another
=>| box with the same current snapshot, but it wouldn't be my first
=>| choice.
=>
=>Yes 5 is crypto are you using DNSSEC? I don't see crypto on mine...
=>Now I suspect kevent is misbehaving with /dev/crypto. Let me cook
=>a kernel with crypto. In the meantime, can you remove /dev/crypto,
=>restart named, and see if it works?
Actually, note that 5 is both crypto and pipe here, which is
odd. An fstat bug, maybe; lsof output seems more sane. With
/dev/crypto renamed I get the same thing as before, with an fstat
of:
===================================================================
USER CMD PID FD MOUNT INUM MODE SZ|DV R/W
root named 18674 wd / 13992411 drwxr-xr-x 512 r
root named 18674 0 /dev/pts 9 crw------- ttyp3 rw
root named 18674 1 /dev/pts 9 crw------- ttyp3 rw
root named 18674 2 /dev/pts 9 crw------- ttyp3 rw
root named 18674 3* unix dgram <-> ffff80000e244980
root named 18674 4 / 13647756 crw-rw-rw- 3674,321743 rw
root named 18674 5* pipe 0xffff8000b8f0b7e0 <- 0xffff8000577cda18 rn
root named 18674 6* pipe 0xffff8000577cda18 -> 0xffff8000b8f0b7e0 w
root named 18674 4* kqueue pending 0
root named 18674 8 / 13648392 cr--r--r-- 1438,320325 r
===================================================================
and an lsof of:
===================================================================
COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAME
named 18674 root 0u VCHR 5,3 0t657 9 /dev/pts/3
named 18674 root 1u VCHR 5,3 0t657 9 /dev/pts/3
named 18674 root 2u VCHR 5,3 0t657 9 /dev/pts/3
named 18674 root 3u unix 0xffff80000e39eb70 0t0
->0xffff80000e09eb68
named 18674 root 4u VCHR 2,2 0t0 13647756 /dev/null
named 18674 root 5r PIPE 0xffff8000b8f0b7e0 16384
->0xffff8000577cda18
named 18674 root 6w PIPE 0xffff8000577cda18 0
->0xffff8000b8f0b7e0
named 18674 root 7u KQUEUE 0xffff800057944380
named 18674 root 8r VCHR 46,0 0t0 13648392
/dev/random
===================================================================
I'm still getting an ident of 5, so it is definitely talking to
the pipe. No other process has the other end of the pipe, so it is
only talking to itself. And when it goes loopy it clearly isn't
talking to anyone, since there is no read/write activity in the
ktruss output.
===================================================================
wheel { ~ } # ktruss -p `pgrep named` | egrep -v
'___lwp_park50|__gettimeofday50|_lwp_unpark'
18674 6 named emul(netbsd)
===================================================================
I do get lots of other activity if I run it under ktruss from
the beginning, but nothing with the above filter once it settles
down.
Gary Duzan
Home |
Main Index |
Thread Index |
Old Index