Affiliation:
1. Carnegie Mellon University, Pittsburgh, PA
Abstract
This paper describes a computer architecture,
Spatial Computation
(SC), which is based on the translation of high-level language programs directly into hardware structures. SC program implementations are completely distributed, with no centralized control. SC circuits are optimized for
wires
at the expense of computation units.In this paper we investigate a particular implementation of SC: ASH (Application-Specific Hardware). Under the assumption that computation is cheaper than communication, ASH replicates computation units to simplify interconnect, building a system which uses very simple, completely dedicated communication channels. As a consequence, communication on the datapath never requires arbitration; the only arbitration required is for accessing memory. ASH relies on very simple hardware primitives, using no associative structures, no multiported register files, no scheduling logic, no broadcast, and no clocks. As a consequence, ASH hardware is fast and extremely power efficient.In this work we demonstrate three features of ASH: (1) that such architectures can be built by automatic compilation of C programs; (2) that distributed computation is in some respects fundamentally different from monolithic superscalar processors; and (3) that ASIC implementations of ASH use three orders of magnitude less energy compared to high-end superscalar processors, while being on average only 33% slower in performance (3.5x worst-case).
Publisher
Association for Computing Machinery (ACM)
Reference115 articles.
1. International technology roadmap for semiconductors (ITRS). http://public.itrs.net/Files/1999 SIA Roadmap/Design.pdf 1999.]] International technology roadmap for semiconductors (ITRS). http://public.itrs.net/Files/1999 SIA Roadmap/Design.pdf 1999.]]
2. Clock rate versus IPC
3. Software pipelining
4. Speed and power scaling of SRAM's
5. SSA is functional programming
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献