David Mark's suggestion

Jim Westervelt westerve at marla.urban.uiuc.edu
Wed Jan 22 10:25:04 EST 1992


Providing an automated capability to estimate execution time of GRASS commands
has been stuck in the rumination-over-beer stage.  We find the following 
hurdles to be problematic - some more so than others:

1) Working under UNIX, GRASS has the wonderful advantage of utilizing virtual
   memory.  A program running entirely in memory without system page swapping
   can run many times faster than the same program running while page swapping 
   is taking place.  Whether or not the system will be busy swapping depends
   on the entire system load.  With multi-user multi-tasking systems it is
   very difficult to estimate up front wall-clock execution time.
2) GRASS is a multi-platform system.  It runs on a wide range of platforms
   (Cray - workstations - PC's) each of which can have anywhere from 1 to
   hundreds of megs of main memory and can have little to gigabytes of swap
   space.  Estimating wall-clock time on different machines is problematic.
3) Variable load.  Assuming that a quick estimation can be made of memory
   and CPU requirements of a GIS operation up-front and that current system
   characteristics can be estimated accurately (CPU speed, available real
   memory, available virtual memory, amount of allocated memory actually
   active (being paged in and out), and speed of associated peripherals (e.g.
   plotters), it is still difficult to anticipate system use.  Some program
   running continuously (at low load) could record a history of system loads.
   At run-time, the GIS program could compare CPU needs with the time-of-day,
   day-of-the-week, and holiday schedule with the historical information to
   make a best guess of anticipated clock-time.  Of course, unanticipated high
   use will throw off the estimate.

Currently our approach has been to leave time estimation up to the user. After
a very few repetitions of a given operation, the user can readily get a feel
for the operation of the software on the given hardware.

One approach we've had is to operate, by default, some of the more CPU
dependent operations in the background.  The user automatically gets mail
upon the completion of the task.  Duane Marble once responded to GIS-L
complaints about this procedure by suggesting that if a GIS operation takes
that long it should have its algorithm overhauled.  While always a possibility,
some operations are inherently difficult to speed up.  For example, consider
the following analyses on a 4000x4000 raster map (16M of pixels): proximity 
analysis (buffer zones) - especially on a latitude-longitude projection,
many neighborhood operations, application of an expert-system rule base to 
individual pixels, and many of the statistical operations involved in image 
processing.  Many vector operations can similarly take a very long time to
operate.   Hence, the conversation that I am continuing above.



More information about the grass-dev mailing list