Affiliation:
1. Sun Microsystems, 901 San Antonio Road, MS USUN02-203, Palo Alto CA
Abstract
Load latency contributes significantly to execution time. Because most cache accesses hit, cache-hit latency becomes an important component of expected load latency. Most modern microprocessors have base+offset addressing loads; thus effective cache-hit latency includes an addition as well as the RAM access.This paper introduces a new technique used in the UltraSPARC III microprocessor, Sum-Addressed Memory (SAM), which performs true addition using the decoder of the RAM array, with very low latency. We compare SAM with other methods for reducing the add part of load latency. These methods include sum-prediction with recovery, and bitwise indexing with duplicate-tolerance. The results demonstrate the superior performance of SAM.
Publisher
Association for Computing Machinery (ACM)
Reference12 articles.
1. Streamlining data cache access with fast address calculation
2. Shade: a fast instruction-set simulator for execution profiling
3. Evaluation of A+B=K conditions without carry propagation
4. DEC Publications Alpha Architecture Handbook Special Announcement Edition DEC Maynard M.A. 1992 DEC Publications Alpha Architecture Handbook Special Announcement Edition DEC Maynard M.A. 1992
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献