Affiliation:
1. SYS Institute, Università della Svizzera Italiana, Lugano, Switzerland
2. EPFL, Lausanne, Switzerland
3. Micron Technology, San Jose, CA, USA
Abstract
Coarse-Grain Reconfigurable Arrays (CGRAs) represent emerging low-power architectures designed to accelerate Compute-Intensive Loops (CILs). The effectiveness of CGRAs in providing acceleration relies on the quality of mapping: how efficiently the CIL is compiled onto the platform. State-of-the-Art (SoA) compilation techniques utilize modulo scheduling to minimize the Iteration Interval (II) and use graph algorithms like Max-Clique Enumeration to address mapping challenges. Our work approaches the mapping problem through a satisfiability (SAT) formulation. We introduce the Kernel Mobility Schedule (KMS), an
ad hoc
schedule used with the Data Flow Graph and CGRA architectural information to generate Boolean statements that, when satisfied, yield a valid mapping. Experimental results demonstrate SAT-MapIt outperforming SoA alternatives in almost 50% of explored benchmarks. Additionally, we evaluated the mapping results in a synthesizable CGRA design and emphasized the runtime metrics trends, i.e., energy efficiency and latency, across different CILs and CGRA sizes. We show that a hardware-agnostic analysis performed on compiler-level metrics can optimally prune the architectural design space, while still retaining Pareto-optimal configurations. Moreover, by exploring how implementation details impact cost and performance on real hardware, we highlight the importance of holistic software-to-hardware mapping flows, as the one presented herein.
Publisher
Association for Computing Machinery (ACM)
Reference45 articles.
1. Omid Akbari, Mehdi Kamal, Ali Afzali-Kusha, Massoud Pedram, and Muhammad Shafique. 2018. PX-CGRA: Polymorphic approximate coarse-grained reconfigurable architecture. In Proceedings of the Design, Automation and Test in Europe Conference and Exhibition. IEEE, 413–418.
2. CRIMSON: Compute-intensive loop acceleration by randomized iterative modulo scheduling and optimized mapping on CGRAs;Balasubramanian Mahesh;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2020
3. Mahesh Balasubramanian and Aviral Shrivastava. 2022. PathSeeker: A Fast mapping algorithm for CGRAs. In Proceedings of the Design, Automation and Test in Europe Conference and Exhibition. 268–273.
4. The gem5 simulator
5. Paul Maximilian Bittner, Thomas Thüm, and Ina Schaefer. 2019. SAT encodings of the at-most-k constraint. In Proceedings of the International Conference on Software Engineering and Formal Methods. Springer, 127–144.