Re: Writing to multiple descriptors with one system call

To: Sad Clouds <cryintothebluesky%googlemail.com@localhost>
Subject: Re: Writing to multiple descriptors with one system call
From: Jean-Yves Migeon <jeanyves.migeon%free.fr@localhost>
Date: Thu, 18 Mar 2010 00:53:26 +0100

On 03/17/10 17:53, Sad Clouds wrote:

1. Accepting connections

On busy web server it is likely that the listening socket will have
multiple entries on the completed connection queue (i.e. established
state). You call accept() in a loop until you get EWOULDBLOCK and since
these are new connections, the chances are all these sockets contain
HTTP GET requests for /index.html

Instead of calling write() 50 times for all 50 new sockets, you could
just call write2v() once.

So basically, between the 1st accept(2) and the last, all the clientsare waiting for input (which they will get sequentially, when you willperform your write2v() after the _last_ accept(2)).

Which means that the 1st "accepted" filedes created will wait for the50th to be accepted. Seems inefficient for me.

Food for thought: how do you expect your write2v() call to handleblocking vs non blocking I/O? In case of a blocking one, should thewrite2v() call return anyway?

2. Sending similar requests

When the server is handling large number of connections, there is a
pretty good chance that some of those connections will request the same
data for the same popular web resources.  You have a queue of active
connections and every time you go in a loop servicing those
connections, you check your cache for valid data. Whenever you have a
cache hit, you mark it and aggregate multiple requests for the same
file into a single reply queue. Again, instead of calling write()
multiple times, you could issue a single system call to write a set of
buffers to multiple sockets.

You make one client wait for the others; this relies on an unverifiableassumption (predicting future?), and delay things. Human beings aresensible to unpleasant lag.

This might make a big difference to the overall system time and
dramatically reduce load.

If you want to achieve such "parallelism", just play with mmap(), forkand threads. The kernel will do the caching for you (if your resource iscalled sufficiently enough, I can't see how the LRU behind will discardit...), and the only induced overhead would be the context switch forthe write(2) syscall (depends on the reentrancy of the OS you use).

Should the context switch overhead become unpleasant for you: roll outyour own in-kernel server system, and syscalls will become function calls.

And pray that you don't have too many buffer overflows flying aroundyour code (or any other exploit).


Cheers :)

--
Jean-Yves Migeon
jeanyves.migeon%free.fr@localhost

Follow-Ups:
- Re: Writing to multiple descriptors with one system call
  - From: Sad Clouds

References:
- Writing to multiple descriptors with one system call
  - From: Sad Clouds
- Re: Writing to multiple descriptors with one system call
  - From: Manuel Bouyer
- Re: Writing to multiple descriptors with one system call
  - From: Sad Clouds
- Re: Writing to multiple descriptors with one system call
  - From: Eric Haszlakiewicz
- Re: Writing to multiple descriptors with one system call
  - From: Sad Clouds

Prev by Date: Re: Writing to multiple descriptors with one system call
Next by Date: Re: Writing to multiple descriptors with one system call
Previous by Thread: Re: Writing to multiple descriptors with one system call
Next by Thread: Re: Writing to multiple descriptors with one system call
Indexes:

Home | Main Index | Thread Index | Old Index