Parallel Performance Project Research Paper
Research Paper
-
Grouping Array Layouts to Reduce Communication and Improve
Locality of Parallel Program
Tien-Pao Shih and Edward S. Davidson
Proceedings of the 1994 International Conference on Parallel and
Distributed Systems, pp 558-566, December 94.
Abstract
-
A data layout method, array grouping, is proposed to improve communication
efficiency and cache utilization of parallel programs containing indirect
array references or nonunit stride indexing. Conditions on where to apply
this technique are specified in a series of theorems. The technique is then
applied to a real finite element application. The experimental results show
that communication is reduced by 15%, and data subcache misses by 40%, on
56 processors of the KSR1 parallel computer.
Back to Publication List, or
Parallel Performance Project Home Page