Improving Energy Efficiency of Coarse-Grain Reconfigurable Arrays Through Modulo Schedule Compression/Decompression-Reference-Cited by-同舟云学术

Improving Energy Efficiency of Coarse-Grain Reconfigurable Arrays Through Modulo Schedule Compression/Decompression

Published:2018-04-02 Issue:1 Volume:15 Page:1-26
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Lee Hochan¹,Moghaddam Mansureh S.¹,Suh Dongkwan²,Egger Bernhard¹

Affiliation:

1. Seoul National University, Seoul, Republic of Korea

2. Samsung Electronics, Seoul, Republic of Korea

Abstract

Modulo-scheduled course-grain reconfigurable array (CGRA) processors excel at exploiting loop-level parallelism at a high performance per watt ratio. The frequent reconfiguration of the array, however, causes between 25% and 45% of the consumed chip energy to be spent on the instruction memory and fetches therefrom. This article presents a hardware/software codesign methodology for such architectures that is able to reduce both the size required to store the modulo-scheduled loops and the energy consumed by the instruction decode logic. The hardware modifications improve the spatial organization of a CGRA’s execution plan by reorganizing the configuration memory into separate partitions based on a statistical analysis of code. A compiler technique optimizes the generated code in the temporal dimension by minimizing the number of signal changes. The optimizations achieve, on average, a reduction in code size of more than 63% and in energy consumed by the instruction decode logic by 70% for a wide variety of application domains. Decompression of the compressed loops can be performed in hardware with no additional latency, rendering the presented method ideal for low-power CGRAs running at high frequencies. The presented technique is orthogonal to dictionary-based compression schemes and can be combined to achieve a further reduction in code size.

Funder

Seoul National University

National Research Foundation of Korea

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3162018

Reference49 articles.

1. Code Compression and Decompression for Coarse-Grain Reconfigurable Architectures

2. Code Compression and Decompression for Instruction Cell Based Reconfigurable Systems

3. New Protection Mechanisms for Intellectual Property in Reconfigurable Logic

4. A Coarse-Grained Array Accelerator for Software-Defined Radio Baseband Processing

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On the Performance Effect of Loop Trace Window Size on Scheduling for Configurable Coarse Grain Loop Accelerators;2021 International Conference on Field-Programmable Technology (ICFPT);2021-12-06

3. Patch scanning displays: spatiotemporal enhancement for displays;Optics Express;2020-01-15

4. Random test program generation for verification and validation of the Samsung Reconfigurable Processor;Journal of Systems Architecture;2019-08

5. Architectures and algorithms for on-device user customization of CNNs;Integration;2019-07