Theodore B. Tabe
Quentin F. Stout
Computer Science and Engineering, University of Michigan
Abstract: The statistical analysis of traces taken from the NAS Parallel Benchmarks can tell one much about the type of network traffic that can be expected from scientific applications run on distributed-memory parallel computers. For instance, such applications utilize a relatively few number of communication library functions, the length of their messages is widely varying, they use many more short messages than long ones, and within a single application the messages tend to follow relatively simple patterns. Information such as this can be used by hardware and software designers to optimize their systems for the highest possible performance.
We analyze both the static and dynamic aspects of the MPI procedure calls in these benchmarks, characterizing the routines that are utilized and the message lengths involved. We also provide the communication kernals of these benchmarks, showing that they have loop-based structures that are easily characterized.
Keywords: parallel computing, benchmarks, trace analysis, message-passing, distributed memory parallel computer, communication patterns, MPI, NAS Parallel Benchmarks, performance analysis
Complete paper. This paper also available as University of Michigan Computer Science and Engineering technical report CSE-TR-386-99.
Related work:
An analysis of the performance of
collective communications, showing the effects of operating system
jitter.
A modest
explanation of parallel computing, a
tutorial, Parallel Computing 101, and a
list of parallel computing resources.
An
overview of our work, and
relevant papers in parallel computing.
![]() |
Copyright © 2004-2009 Quentin F. Stout. |