Chameleon-Reference-Cited by-同舟云学术

Chameleon

Published:2010-04 Issue:1 Volume:7 Page:1-35
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Woo Dong Hyuk¹,Fryman Joshua B.²,Knies Allan D.²,Lee Hsien-Hsin S.¹

Affiliation:

1. Georgia Institute of Technology

2. Intel Corporation

Abstract

Heterogeneous multicore processors have emerged as an energy- and area-efficient architectural solution to improving performance for domain-specific applications such as those with a plethora of data-level parallelism. These processors typically contain a large number of small, compute-centric cores for acceleration while keeping one or two high-performance ILP cores on the die to guarantee single-thread performance. Although a major portion of the transistors are occupied by the acceleration cores, these resources will sit idle when running unparallelized legacy codes or the sequential part of an application. To address this underutilization issue, in this article, we introduce Chameleon, a flexible heterogeneous multicore architecture to virtualize these resources for enhancing memory performance when running sequential programs. The Chameleon architecture can dynamically virtualize the idle acceleration cores into a last-level cache, a data prefetcher, or a hybrid between these two techniques. In addition, Chameleon can operate in an adaptive mode that dynamically configures the acceleration cores between the hybrid mode and the prefetch-only mode by monitoring the effectiveness of the Chameleon cache mode. In our evaluation with SPEC2006 benchmark suite, different levels of performance improvements were achieved in different modes for different applications. In the case of the adaptive mode, Chameleon improves the performance of SPECint06 and SPECfp06 by 31% and 15%, on average. When considering only memory-intensive applications, Chameleon improves the system performance by 50% and 26% for SPECint06 and SPECfp06, respectively.

Funder

Division of Computing and Communication Foundations

Division of Computer and Network Systems

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/1736065.1736068

Reference68 articles.

1. Exploiting Postdominance for Speculative Parallelization

2. Data prefetching by dependence graph precomputation

3. Arevalo A. Matinata R. Pandian M. Peri E. Ruby K. Thomas F. and Almond C. 2008. Programming the Cell Broadband Engine Architecture: Examples and Best Practices. IBM Redbooks Armonk NY. Arevalo A. Matinata R. Pandian M. Peri E. Ruby K. Thomas F. and Almond C. 2008. Programming the Cell Broadband Engine Architecture: Examples and Best Practices. IBM Redbooks Armonk NY.

4. Wattch

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ad-heap;Proceedings of Workshop on General Purpose Processing Using GPUs;2014-03