Power-Efficient Predication Techniques for Acceleration of Control Flow Execution on CGRA
-
Published:2013-05
Issue:2
Volume:10
Page:1-25
-
ISSN:1544-3566
-
Container-title:ACM Transactions on Architecture and Code Optimization
-
language:en
-
Short-container-title:ACM Trans. Archit. Code Optim.
Author:
Han Kyuseung1,
Ahn Junwhan1,
Choi Kiyoung1
Affiliation:
1. Seoul National University
Abstract
Coarse-grained reconfigurable architecture typically has an array of processing elements which are controlled by a centralized unit. This makes it difficult to execute programs having control divergence among PEs without predication. However, conventional predication techniques have a negative impact on both performance and power consumption due to longer instruction words and unnecessary instruction-fetching decoding nullifying steps. This article reveals performance and power issues in predicated execution which have not been well-addressed yet. Furthermore, it proposes fast and power-efficient predication mechanisms. Experiments conducted through gate-level simulation show that our mechanism improves energy-delay product by 11.9% to 23.8% on average.
Funder
Ministry of Knowledge Economy
National Research Foundation of Korea
Ministry of Education, Science and Technology
Hanyang University
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference28 articles.
1. ARM NEON. 2013. The ARM® NEONTM general-purpose SIMD engine. http://www.arm.com/products/processors/technologies/neon.php. ARM NEON. 2013. The ARM® NEON TM general-purpose SIMD engine. http://www.arm.com/products/processors/technologies/neon.php.
Cited by
21 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献