Parallel Performance Project Research Paper

Research Paper

Grouping Array Layouts to Reduce Communication and Improve Locality of Parallel Program
Tien-Pao Shih and Edward S. Davidson
Proceedings of the 1994 International Conference on Parallel and Distributed Systems, pp 558-566, December 94.

Abstract

A data layout method, array grouping, is proposed to improve communication efficiency and cache utilization of parallel programs containing indirect array references or nonunit stride indexing. Conditions on where to apply this technique are specified in a series of theorems. The technique is then applied to a real finite element application. The experimental results show that communication is reduced by 15%, and data subcache misses by 40%, on 56 processors of the KSR1 parallel computer.
Back to Publication List, or Parallel Performance Project Home Page