Parallel Performance Project Research Paper
Research Paper
-
Improving Performance of L1 Cache With an Associated Buffer
Vijayalakshmi Srinivasan and Edward S. Davidson
Technical Report CSE-TR-361-98, University of Michigan, March, 1998
Abstract
-
Memory access latencies are much larger than processor cycle times, and the
trend has been for this gap to increase over time. Cache performance
becomes critical in bridging this gap. However, since it is difficult to
make a cache both large and fast, cache misses are expected
to continue to have a significant performance impact.
Victim caching, proposed by Jouppi is an approach
to decrease the miss ratio of direct-mapped caches without affecting
their access time. NTS caching, proposed by Rivers is
a multilateral cache design scheme that improves performance
of first-level(L1) caches based on the temporal locality of the reference
pattern. We propose an improvement of these schemes,
which we call NT-victim caching. Taking the lead from NTS design
we have a bilateral L1 cache, having a main cache (cache A) and a
small fully associative buffer (cache B). Cache B is similar to a
victim buffer, and holds the victim block replaced
by a miss. A cache block is temporal if after it is brought into cache,
some word in that block is accessed more than once before the block is replaced.
Unlike Victim caches a block that is hit in cache B is swapped with
a block in cache A only if it is temporal, and in most of our replacement
strategies temporal blocks are less likely to be selected for
replacement than non-temporal blocks. Every cache block
loaded into L1 cache is monitored for temporal behavior by a hardware
detection unit. Our results show that this design reduces the number of
swaps between cache A and cache B, relative to the Victim cache, yet
gives a comparable miss ratio.
Back to Publication List, or
Parallel Performance Project Home Page