parallel computing, SMP, and threading

To: Tim & Alethea Larson <thelarsons3%cox.net@localhost>
Subject: parallel computing, SMP, and threading
From: "Erik E. Fair" <fair%netbsd.org@localhost>
Date: Fri, 30 Apr 2004 13:00:07 -0700

At 12:43 -0500 4/30/04, Tim & Alethea Larson wrote:

Yes, I thought good threading was something of a prerequisiteto SMP. What does threading do for us on a non-MP system? Can thekernel scheduler get things done more efficiently with threaded apps?


        -----

To be perfectly clear, you don't need any kind of thread support foran MP or SMP system to be useful. The utility is in having more thanone processor to pick processes off the run queue to run in parallel.Since UNIX loves to spawn processes, this wins for throughput rightaway even if any particular application doesn't run any faster thanit did on a uniprocessor system with the same speed CPU. Imagine whatan SMP does for E-mail processing on an SMTP server when the MTAspawns a new process for each SMTP client that has contacted it. EachSMTP connection is independent, and can be run in parallel. Addprocessors, speed things up (until you run into some other limit,like disk or RAM bandwidth).

Thread support isn't even required to speed up your application, ifyour application can spawn additional processes to divide the work.Take compiling a program with "make -j N" for N CPUs, for example.Make knows which parts of the building process can be done inparallel (i.e. that do not depend on each other), and which partsmust be serialized (do one before starting the other, e.g. runninglex(1) or yacc(1) to generate a ".c" file before running cc(1) tocompile). Make essentially performs data flow analysis on programcompilation:


        http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?query=data+flow+analysis

However, you'll note that make doesn't require shared memory for itswork - merely a shared filesystem. So, if you tell make(1) how manyprocessors you have, it will spawn as many parallel compiles, etc.,as it can within the min() of the number of CPUs (as specified by"-j") or possible parallel compiles (as specified by the structure ofthe Makefile).

If your application has a lot of shared data that needs to beaccessed quickly (i.e. faster than disk access), then threading makessense - one process, many "threads" running in that process with ashared address space. Just be careful to watch out for data integrityby using semaphores to lock shared data structures before modifyingthem. Also, depending on the application, you may find that the cachecoherency and semaphore overhead eats away at some of the potentialperformance gain, if your application shares memory "too much".

Many of these issues are discussed in detail in the book, "In Searchof Clusters" (2nd Ed.) by Gregory F. Pfister. The NetBSD Project hasa mailing list for discussing clustering for NetBSD systems:tech-cluster%netbsd.org@localhost


It's also important to remember Amdahl's Law:

        http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?query=Amdahl%27s+Law

I hope this clarifies things somewhat.

        Erik <fair%netbsd.org@localhost>

Prev by Date: SGE and Globus Toolkit on NetBSD
Next by Date: SGE 5.3p6 ported to NetBSD, call for beta testers
Previous by Thread: SGE and Globus Toolkit on NetBSD
Next by Thread: SGE 5.3p6 ported to NetBSD, call for beta testers
Indexes:

Home | Main Index | Thread Index | Old Index