AHA: An Agile Approach to the Design of Coarse-Grained Reconfigurable Accelerators and Compilers-Reference-Cited by-同舟云学术

AHA: An Agile Approach to the Design of Coarse-Grained Reconfigurable Accelerators and Compilers

Published:2023-01-24 Issue:2 Volume:22 Page:1-34
ISSN:1539-9087
Container-title:ACM Transactions on Embedded Computing Systems
language:en
Short-container-title:ACM Trans. Embed. Comput. Syst.

Author:

Koul Kalhan¹^ORCID,Melchert Jackson¹^ORCID,Sreedhar Kavya¹^ORCID,Truong Leonard¹^ORCID,Nyengele Gedeon¹^ORCID,Zhang Keyi¹^ORCID,Liu Qiaoyi¹^ORCID,Setter Jeff¹^ORCID,Chen Po-Han¹^ORCID,Mei Yuchen¹^ORCID,Strange Maxwell¹^ORCID,Daly Ross¹^ORCID,Donovick Caleb¹^ORCID,Carsello Alex¹^ORCID,Kong Taeyoung¹^ORCID,Feng Kathleen¹^ORCID,Huff Dillon¹^ORCID,Nayak Ankita¹^ORCID,Setaluri Rajsekhar¹^ORCID,Thomas James¹^ORCID,Bhagdikar Nikhil¹^ORCID,Durst David¹^ORCID,Myers Zachary¹^ORCID,Tsiskaridze Nestan¹^ORCID,Richardson Stephen¹^ORCID,Bahr Rick¹^ORCID,Fatahalian Kayvon¹^ORCID,Hanrahan Pat¹^ORCID,Barrett Clark¹^ORCID,Horowitz Mark¹^ORCID,Torng Christopher¹^ORCID,Kjolstad Fredrik¹^ORCID,Raina Priyanka¹^ORCID

Affiliation:

1. Stanford University, Stanford, California, USA

Abstract

With the slowing of Moore’s law, computer architects have turned to domain-specific hardware specialization to continue improving the performance and efficiency of computing systems. However, specialization typically entails significant modifications to the software stack to properly leverage the updated hardware. The lack of a structured approach for updating the compiler and the accelerator in tandem has impeded many attempts to systematize this procedure. We propose a new approach to enable flexible and evolvable domain-specific hardware specialization based on coarse-grained reconfigurable arrays (CGRAs). Our agile methodology employs a combination of new programming languages and formal methods to automatically generate the accelerator hardware and its compiler from a single source of truth. This enables the creation of design-space exploration frameworks that automatically generate accelerator architectures that approach the efficiencies of hand-designed accelerators, with a significantly lower design effort for both hardware and compiler generation. Our current system accelerates dense linear algebra applications but is modular and can be extended to support other domains. Our methodology has the potential to significantly improve the productivity of hardware-software engineering teams and enable quicker customization and deployment of complex accelerator-rich computing systems.

Funder

DSSoC DARPA

Stanford AHA Agile Hardware Center

Affiliates Program, Intel’s Science and Technology Center

Stanford SystemX Alliance

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3534933

Reference61 articles.

1. Chipyard: Integrated Design, Simulation, and Implementation Framework for Custom SoCs

2. Creating an Agile Hardware Design Flow

3. Clark Barrett Pascal Fontaine and Cesare Tinelli. 2016. The Satisfiability Modulo Theories Library (SMT-LIB). www.SMT-LIB.org.

4. LegUp

5. Amber: A 367 GOPS, 538 GOPS/W 16nm SoC with a Coarse-Grained Reconfigurable Array for Flexible Acceleration of Dense Linear Algebra

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HierCGRA: A Novel Framework for Large-scale CGRA with Hierarchical Modeling and Automated Design Space Exploration;ACM Transactions on Reconfigurable Technology and Systems;2024-05-10

2. R-Blocks: an Energy-Efficient, Flexible, and Programmable CGRA;ACM Transactions on Reconfigurable Technology and Systems;2024-05-10

3. FDRA: A Framework for a Dynamically Reconfigurable Accelerator Supporting Multi-Level Parallelism;ACM Transactions on Reconfigurable Technology and Systems;2024-01-27

4. A CGRA Front-end Compiler Enabling Extraction of General Control and Dedicated Operators;2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC);2024-01-22

5. ImageMap: Enabling Efficient Mapping from Image Processing DSL to CGRA;Lecture Notes in Computer Science;2024