Current-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: high cpu load with tcpdump



On Mon, Feb 29, 2016 at 09:11:44PM +0100, 6bone%6bone.informatik.uni-leipzig.de@localhost wrote:
> On Mon, 29 Feb 2016, Christos Zoulas wrote:
> 
> >| Hello,
> >|
> >| the problem occurs only on one of my servers. I tried to find the
> >| difference. It is the bind9 (bind-9.10.3pl3). If I stop the bind9, tcpdump
> >| works without problems. When I restart the bind9, the CPU load goes back
> >| to 100%.
> >|
> >| Is it a problem of the kernel, tcpdump or bind9?
> >
> >Can you ktrace the bind? Perhaps it is waking up tcpdump spuriously.
> >That would indicate a kernel problem.

There seems to be a trend

>   2362      3 named    1456773618.152029244 RET   recvmsg -1 errno 35 Resource temporarily unavailable
>   2362     11 named    1456773618.152035390 GIO   fd 5 read 8 bytes ",\^B\0\0\M-{\M^?\M^?\M^?"
>   2362     11 named    1456773618.152041396 RET   read 8
>   2362      6 named    1456773618.152040698 CALL  recvmsg(0x260,0x7f7ff15fb370,0)
>   2362      9 named    1456773618.152033854 CALL  recvmsg(0x262,0x7f7ff09f8370,0)
>   2362     11 named    1456773618.152050266 CALL  __kevent50(8,0x7f7ff01f6f20,1,0,0,0)
>   2362      9 named    1456773618.152061650 MISC  msghdr: [name=0x7f7fef50f328, namelen=128, iov=0x7f7ff09f83a0, iovlen=1, control=0x7f7ff3d98fa0, controllen=96, flags=4000000]
>   2362      6 named    1456773618.152051663 MISC  msghdr: [name=0x7f7fef517088, namelen=128, iov=0x7f7ff15fb3a0, iovlen=1, control=0x7f7fec6ca1e0, controllen=96, flags=4000000]
>   2362      3 named    1456773618.152041047 CALL  write(7,0x7f7ff21fe3e0,8)
>   2362     11 named    1456773618.152075688 RET   __kevent50 -1 errno 2 No such file or directory
>   2362      9 named    1456773618.152081694 RET   recvmsg -1 errno 35 Resource temporarily unavailable
>   2362      6 named    1456773618.152085116 RET   recvmsg -1 errno 35 Resource temporarily unavailable

recvmsg() returns loads of errno 35 Resource temporarily unavailabel aka EAGAIN.

In the Citrix client thread

  http://mail-index.netbsd.org/netbsd-users/2016/02/03/msg017788.html

recvmsg() returns loads of EAGAIN, then a timer in linux_select1 terminates
=> EINTR/SEGV


Today we saw an instance of dansguardian core dump with a SEGV and a
backtrace says it was in recv(). This is happening repeatedly
(NetBSD7/amd64 stable), but unpredicably - all is well for days, then
a flurry of coredumps (restarted by watchdog). So not reproducible at
will, or at least no obvious conditions to set it off.

Really a coincidence that 3 different breakage scenarios all seem to involve
recv() returning EAGAIN?

Puzzled...

Cheers,

Patrick


Home | Main Index | Thread Index | Old Index