Impact of high-level transformations within the ROCCC framework

Author:

Buyukkurt Betul1,Cortes John1,Villarreal Jason1,Najjar Walid A.1

Affiliation:

1. University of California, Riverside, Riverside, CA

Abstract

Reconfigurable computers, where one or more FPGAs are attached to a conventional microprocessor, are promising platforms for code acceleration. Despite their advantages, programmability concerns and the lack of efficient design tools/compilers for FPGAs are preventing the technology's widespread adoption. The traditional compiler technology is microprocessor-based-systems-specific and needs to be customized and augmented to address the needs in reconfigurable computing. The challenges are several due to the resources and performance constraints for FPGAs being drastically different than those of microprocessors, and also that compiling for FPGAs requires laying the computation in space by a circuit rather than in time by a sequence of instructions. ROCCC is an optimizing C-to-VHDL compiler specifically targeting the reconfigurable computer platforms. ROCCC includes several high-level optimizations that parallelize and optimize the source code for minimized area and critical path length and maximized throughput. This article presents the effect of ROCCC's high-level transformations on the performance of the generated VHDL output. ROCCC utilizes: (1) several array access optimizations to eliminate redundant memory accesses, (2) procedure-level optimizations to achieve circuit area reductions of up to 88% compared to circuit areas generated from unoptimized codes, (3) loop-level optimizations to increase the throughput, and (4) transformations unique to certain classes of applications. The preceding listed features help ROCCC generate circuits with very large degrees of parallelism capable of very high computation rates.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Reference36 articles.

1. Aigner G. Diwan A. Heine D. L. Lam M. S. Moore D. L. Murphy B. R. and Sapuntzakis C. 2010. An overview of the SUIF2 compiler infrastructure. Tech. rep. Computer Systems Laboratory Stanford University. Aigner G. Diwan A. Heine D. L. Lam M. S. Moore D. L. Murphy B. R. and Sapuntzakis C. 2010. An overview of the SUIF2 compiler infrastructure. Tech. rep. Computer Systems Laboratory Stanford University.

2. Altera. 2010a. Automated generation of hardware accelerators with direct memory access from ANSI/ISO standard C functions. http://www.altera.com/literature/wp/wp-aghrdwr.pdf. Altera. 2010a. Automated generation of hardware accelerators with direct memory access from ANSI/ISO standard C functions. http://www.altera.com/literature/wp/wp-aghrdwr.pdf.

3. Altera. 2010b. Nios II C2H user guide. http://www.altera.com/literature/ug/ug_nios2_c2h_compiler.pdf. Altera. 2010b. Nios II C2H user guide. http://www.altera.com/literature/ug/ug_nios2_c2h_compiler.pdf.

4. Spatial computation

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. EASY;Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays;2019-02-20

2. Big Data and HPC Acceleration with Vivado HLS;FPGAs for Software Programmers;2016

3. A Quick Tour of High-Level Synthesis Solutions for FPGAs;FPGAs for Software Programmers;2016

4. High-Level Synthesis;FPGAs for Software Programmers;2016

5. The LEAP FPGA Operating System;FPGAs for Software Programmers;2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3