port-alpha: Sleazy timing code

Subject: Sleazy timing code
To: None <port-alpha@netbsd.org, port-i386@netbsd.org>
From: Hal Murray <murray@pa.dec.com>
List: port-alpha
Date: 05/30/2000 03:25:05
I have some sleazy code that I use to measure CPU usage.  The idea 
is to run a low priority job that eats up all the otherwise unused 
CPU cycles.  The ones it doesn't get were obviously "used" by something.  

  The info that top and friends show are too inaccurate for my interests.  
  They get sampled on clock interrupts and that can easily get into 
  lock step with network traffic. 

The code I use looks like this:

  for ( ; ; ) {
    for (j = 0; j < xxx; j++) {}
    counter[cpu].ticks++;
    if (quit) break;
  }

xxx is adjusted (by trial and error) so that the counter gets bumped 
often enough to be accurate and infrequently enough so that the time 
spent bumping the counter doesn't use too many cycles. 

The key is to work out a calibration factor.  For that, I need two 
things.  One is the speed of the CPU.  The other is the number of 
cycles per inner loop iteration. 

So my first question...  Is there any clean way to get the CPU speed 
on a NetBSD system?  I've looked in things like sysctl hw but haven't 
found it.

  Right now, I just #include a small chunk of per-platform code and 
  maintain that by hand.  It's ugly, but it works. 

The other problem is the cycles-per-loop.  Typical numbers on i386 
systems are 4 to 6.  It obviously depends upon the type of CPU chip.  
It also seems to vary by 1 depending upon the alignment of the code 
in memory.  I don't care what it is as long as it doesn't change 
when I recompile something. 

What I've been doing so far is to put that subroutine first in its 
module, and load that module first.  That worked for 1.4.2.  (And 
since it worked, I didn't mess with it.)

But it doesn't seem to be working on the latest snapshot, at least 
on i386 systems.  I haven't tried an Alpha yet.

Is this a quirk with the new ELF stuff?  Is it rearranging code for 
me? 

Is there any way to force the compiler/loader to put some code at 
a specified alignment?

  If I put it into a library will it get page aligned or something 
  helpful like that? 

  I don't care how fast it goes as long as it does the same thing 
  after I change some other code and recompile/load. 

Is that even the right question?  Did something I haven't thought 
about change between 1.4.2 and the latest shapshot?