Affiliation:
1. Center for Embedded Computer Systems, University of California, Irvine, CA, USA
Abstract
Memory accesses represent a major bottleneck in embedded systems power and performance. Traditionally, designers tried to alleviate this problem by relying on a simple cache hierarchy, or a limited use of special purpose memory modules such as stream buffers. Although real-life applications contain a large number of memory references to a diverse set of data structures, a significant percentage of all memory accesses in the application are generated from a few memory instructions that exhibit predictable, well-known access patterns; this creates an opportunity for memory customization, targeting the needs of these access patterns. We present APEX, an approach that extracts, analyzes and clusters the most active access patterns in the application, and aggressively customizes the memory architecture to match the needs of the application. Moreover, though the memory modules are important, the rate at which the memory system can produce the data for the CPU is significantly impacted by the connectivity architecture between the memory subsystem and the CPU. Thus, it is critical to consider the connectivity architecture early in the design flow, in conjunction with the memory architecture. We couple the exploration of memory modules together with their connectivity, to evaluate a wide range of cost, performance, and energy connectivity architectures. We use a heuristic to prune the design space, guiding the exploration towards the most promising designs. We present experiments on a set of large real-life benchmarks, showing significant performance improvements for varied cost and power characteristics, allowing the designer to evaluate customized memory and connectivity configurations for embedded systems.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference37 articles.
1. ARM AMBA Bus Specification. http://www.arm.com/armwww.ns4/html/AMBA?OpenDocument. ARM AMBA Bus Specification. http://www.arm.com/armwww.ns4/html/AMBA?OpenDocument.
2. Bakshi S. and Gajski D. 1995. A memory selection algorithm for high-performance pipelines. In EURO-DAC. Bakshi S. and Gajski D. 1995. A memory selection algorithm for high-performance pipelines. In EURO-DAC.
3. Catthoor F. Wuytack S. De Greef E. Balasa F. Nachtergaele L. and Vandecappelle A. 1998. Custom Memory Management Methodology. Kluwer. Catthoor F. Wuytack S. De Greef E. Balasa F. Nachtergaele L. and Vandecappelle A. 1998. Custom Memory Management Methodology. Kluwer.
4. Chen H.-M. Zhou H. Young F. Wong D. Yang H. and Sherwani N. 1999. Integrated floorplanning and interconnect planning. In ICCAD. Chen H.-M. Zhou H. Young F. Wong D. Yang H. and Sherwani N. 1999. Integrated floorplanning and interconnect planning. In ICCAD.
5. Chou P. Ortega R. and Borriello G. 1995. Interface co-synthesis techniques for embedded systems. In ICCAD. Chou P. Ortega R. and Borriello G. 1995. Interface co-synthesis techniques for embedded systems. In ICCAD.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. EXPRESSION;Processor Description Languages;2008