Re: Fixing pool_cache_invalidate(9), final step

To: Lars Heidieker <lars.heidieker%googlemail.com@localhost>
Subject: Re: Fixing pool_cache_invalidate(9), final step
From: Jean-Yves Migeon <jeanyves.migeon%free.fr@localhost>
Date: Thu, 31 May 2012 12:55:42 +0100

On Wed, 30 May 2012 22:43:14 +0200, Lars Heidieker wrote:

Well, if I read the patch right the transfer_xcall will be calledfrom
pool_drain_start and be waited for in pool_drain_end and therefor
calling the other xcall in pool_cache_invalidate that will releasetheper cpu cached objects directly into the pool will most likely haveno
work to do.


Yes.

One question puzzles me right now, isn't it more efficient to do the
cache group swapping then to release the cpu locally cached objects
directly to the pool?

IMHO the bottlenecks are the ctor/dtor and not the pool_cache objectsthemselves. Feeding back all objects to the pool for invalidation by thecalling CPU frees ncpu-1 CPUs from doing object destruction, except thecaller CPU which has to do all the heavy work.

This depends on how ctor/dtor locking is done. "Distributing" the samework (object destruction) to different CPUs while using mutexes inctor/dtor will basically serialize the operation; true, this isinefficient.

Having the cpu cached objects pushed to the pool_cache global cache
and then draining it on the calling cpu seems quite efficient to me.
Just an idea (motivated by the fact that the split of pool_drain is
ugly imho as it exposes an internal state of the draining, the xcall
arg) merge pool_drain_start and pool_drain_end and remove the xcall
from it and do the xcall from pool_cache_invalidate and probably give
it an argument if to wait aka doing it synchronous.

I will propose a patch later today including the drain_start/_end mergewith the transfer xcall.

As all cpus might have caches to release not much time will be wasted
and the will content the pool_cache global cache which should be
quicker then the contenting for the underlying pool (I haven't
checked...)

There's also contention for the destruction of objects (not necessarilyheavy for a few, but there may be thousands of them cached). I wouldexpect this part to take longer than the contention on the underlyingpool.

The "normal" drain call from the pagedaemon might be done async (sowe
don't wait for a cpu busy with interrupts in that case which might
happen with low priority xcalls and which is one of your concerns ifI
got it right).


Yes.

Except for the async idea, I had this running with high-priority
xcalls localy...

for the pool_cache destruction case we have to make sure that any
pool_cache_invalidate with it's pool_cache_invalidate_groug callsand
probably xcall that do the cpu local cache to pool_cache global
transfer have finished before pool_cache_invalidate_cpu is called.
if pool_cache_xcall is called from pool_drain the we could rejoin
pool_drain_start and pool_drain_end....
The synchronous pool_cache_invalidate_<whatever> would fit such agoal, no?

this should be covered with giving pool_cache_invalidate an arg.


Fine.

if my grepping is right pool_reclaim/pool_cache_reclaim are onlyuserin subr_pool.c (for _HARDKERNEL) so draining should indeed onlyhappenfrom the pageout daemon and not from interrupt context at all, sowe
should (if I'm not mistaken) be able to simplify be integrating
pool_drain_start/end and calling the pool_cache_xcall from
pool_cache_invalidate...
currently with the split pool_drain_start/end the time the xcallrunsmight on SMP be used to drain the buffer pool, the only differentthat
comes to my mind quickly.
I came to the same conclusion while looking at the drain_start/_end
functions, but they were outside the scope of my patch (I expected
oppositions, so kept it as small as possible).
I think this comes as a package ;-)

Just some thoughts,

Will do, however I cannot make such an API change for -6, even if itdoes not appear in the documentation (I can still ask releng@ and core@,but I highly doubt that this is worth the hassle). I can do this for-current though.

I will update both patches accordingly, but the drain start/end splitwill likely remain for -6.


Thanks!

--
Jean-Yves Migeon
jeanyves.migeon%free.fr@localhost

Follow-Ups:
- Re: Fixing pool_cache_invalidate(9), final step
  - From: Jean-Yves Migeon

References:
- Fixing pool_cache_invalidate(9), final step
  - From: Jean-Yves Migeon
- Re: Fixing pool_cache_invalidate(9), final step
  - From: Lars Heidieker
- Re: Fixing pool_cache_invalidate(9), final step
  - From: Jean-Yves Migeon
- Re: Fixing pool_cache_invalidate(9), final step
  - From: Lars Heidieker

Prev by Date: Re: Severe netbsd-6 NFS server-side performance issues
Next by Date: Re: libusb-1.0 for NetBSD
Previous by Thread: Re: Fixing pool_cache_invalidate(9), final step
Next by Thread: Re: Fixing pool_cache_invalidate(9), final step
Indexes:

Home | Main Index | Thread Index | Old Index