Affiliation:
1. Stanford University
2. University of California at Berkeley
Abstract
Despite their superior performance for multimedia applications, vector processors have three limitations that hinder their widespread acceptance. First, the complexity and size of the centralized vector register file limits the number of functional units. Second, precise exceptions for vector instructions are difficult to implement. Third, vector processors require an expensive on-chip memory system that supports high bandwidth at low access latency.This paper introduces CODE, a scalable vector microarchitecture that addresses these three shortcomings. It is designed around a clustered vector register file and uses a separate network for operand transfers across functional units. With extensive use of decoupling, it can hide the latency of communication across functional units and provides 26% performance improvement over a centralized organization. CODE scales efficiently to 8 functional units without requiring wide instruction issue capabilities. A renaming table makes the clustered register file transparent at the instruction set level. Renaming also enables precise exceptions for vector instructions at a performance loss of less than 5%. Finally, decoupling allows CODE to tolerate large increases in memory latency at sub-linear performance degradation without using on-chip caches. Thus, CODE can use economical, off-chip, memory systems.
Publisher
Association for Computing Machinery (ACM)
Reference25 articles.
1. D. Alpert. Scalable MicroSuperComputers. Microprocessor Report pages 1--6 March 2003. D. Alpert. Scalable MicroSuperComputers. Microprocessor Report pages 1--6 March 2003.
2. K. Asanović. Vector Microprocessors. PhD thesis Computer Science Division University of California at Berkeley 1998. K. Asanović. Vector Microprocessors. PhD thesis Computer Science Division University of California at Berkeley 1998.
3. Direct RAMbus technology: the new main memory standard
4. EEMBC Benchmarks. http://www.eembc.org. EEMBC Benchmarks. http://www.eembc.org.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MANIC;Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture;2019-10-12
2. Memory Controller for Vector Processor;Journal of Signal Processing Systems;2016-12-28
3. Multicore-based vector coprocessor sharing for performance and energy gains;ACM Transactions on Embedded Computing Systems;2013-09
4. Versatile design of shared vector coprocessors for multicores;Microprocessors and Microsystems;2012-10