Affiliation:
1. Washington University, St. Louis, MO, USA
Abstract
The disparity in performance between processors and main memories has led computer architects to incorporate large cache hierarchies in modern computers. Because these cache hierarchies are designed to be general-purpose, they may not provide the best possible performance for a given application. In this paper, we determine a memory subsystem well suited for a given application and main memory by discovering a memory subsystem comprised of caches,scratchpads, and other components that are combined to provide better performance. We draw motivation from the superoptimization of instruction sequences, which successfully finds unusually clever instruction sequences for programs. Targeting both ASIC and FPGA devices, we show that it is possible to discover unusual memory subsystems that provide performance improvements over a typical memory subsystem.
Funder
Division of Computer and Network Systems
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Reference35 articles.
1. Leap scratchpads
2. A dynamically tunable memory hierarchy
3. Cache-conscious data placement
4. Compiler optimizations for improving data locality
5. J. Chang P. Ranganathan D. A. Roberts M. A. Shah and J. Sontag. Data storage apparatus and methods Mar. 2012. US Patent App. 2012/0131278. J. Chang P. Ranganathan D. A. Roberts M. A. Shah and J. Sontag. Data storage apparatus and methods Mar. 2012. US Patent App. 2012/0131278.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献