netbsd-users: #32287 Processes hang in "mclpl"

Subject: #32287 Processes hang in "mclpl" - feedback
To: None <netbsd-users@netbsd.org>
From: Jorgen Lundman <lundman@lundman.net>
List: netbsd-users
Date: 08/22/2006 15:43:24

NetBSD  3.0 NetBSD 3.0 (netbsd) #0: Tue Dec 27 12:49:48 JST 2005 
root@netbsd:/usr/src/netbsd-3.0/sys/arch/i386/compile/netbsd i386

I was hitting this bug quite frequently:

http://www.netbsd.org/cgi-bin/query-pr-single.pl?number=32287


1179 ?     DWs  100:55.29 ./lundftpd

  1179 root     -22    0  6408K    4K mclpl    100:55  0.00%  0.00%
  <lundftpd>

sysctl kern.somaxkva
kern.somaxkva = 16777216

(gdb) p nmbclusters
$1 = 32768

NIC down/up did nothing, increasing sokva did not "unstick" it. Only reboot 
fixed the issue.


I tried to compile -current for the fixes mentioned, but -current is not easy to 
compile from 3.0.x at the moment. Seems to want a new share/mk, gcc v4 
("--no-pointer-sign & --no-attributes" are pulled in even though it does check 
for GCC_VERSION > 4).

So instead I patched in the new versions of uvm_map.[ch] uvm_km.c.  I did not 
patch the "src/sys/kern/subr_pool.c" file. Perhaps I should?


Anyway, we have managed to run twice as long as previous record with new 
patches, so they appear to make things better.

Now when the bug would have triggered I merely get:

WARNING: mclpool limit reached; increase NMBCLUSTERS 
WARNING: mclpool limit reached; increase NMBCLUSTERS
WARNING: mclpool limit reached; increase NMBCLUSTERS

.. and a short hang.

I will go to 65536 and try.

But apart from that, just reporting that the patches at least appears to have 
lifted the need to reboot every day.

Lund



-- 
Jorgen Lundman       | <lundman@lundman.net>
Unix Administrator   | +81 (0)3 -5456-2687 ext 1017 (work)
Shibuya-ku, Tokyo    | +81 (0)90-5578-8500          (cell)
Japan                | +81 (0)3 -3375-1767          (home)