How to efficiently implement dynamic circuit specialization systems

Author:

Abouelella Fatma1,Davidson Tom1,Meeus Wim1,Bruneel Karel1,Stroobandt Dirk1

Affiliation:

1. Ghent University, Ghent, Belgium

Abstract

Dynamic circuit specialization (DCS) is a technique used to implement FPGA applications where some of the input data, called parameters, change slowly compared to other inputs. Each time the parameter values change, the FPGA is reconfigured by a configuration that is specialized for those new parameter values. This specialized configuration is much smaller and faster than a regular configuration. However, the overhead associated with the specialization process should be minimized to achieve the desired benefits of using the DCS technique. This overhead is represented by both the FPGA resources needed to specialize the FPGA at runtime and by the specialization time. The introduction of parameterized configurations [Bruneel and Stroobandt 2008] has improved the efficiency of DCS implementations. However, the specialization overhead still takes a considerable amount of resources and time. In this article, we explore how to efficiently build DCS systems by presenting a variety of possible solutions for the specialization process and the overhead associated with each of them. We split the specialization process into two main phases: the evaluation and the configuration phase. The PowerPC embedded processor, the MicroBlaze, and a customized processor (CP) are used as alternatives in the evaluation phase. In the configuration phase, the ICAP and a custom configuration interface (SRL configuration) are used as alternatives. Each solution is used to implement a DCS system for three applications: an adaptive finite impulse response (FIR) filter, a ternary content-addressable memory (TCAM), and a regular expression matcher (RegEx). The experiments show that the use of our CP along with the SRL configuration achieves minimum overhead in terms of resources and time. Our CP is 1.8 and 3.5 times smaller than the PowerPC and the area-optimized implementation of the MicroBlaze, respectively. Moreover, the use of the CP enables a more compact representation for the parameterized configuration in comparison to both the PowerPC and the MicroBlaze processors. For instance, in the FIR, the parameterized configuration compiled for our CP is 6--7 times smaller than that for the embedded processors.

Funder

Seventh Framework Programme

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Reference34 articles.

1. Efficiently Generating FPGA Configurations through a Stack Machine

2. Abouelella F. Bruneel K. and Stroobandt D. 2010b. Towards a more efficient run-time FPGA configuration generation. In Parallel Computing: From Multicores and GPU's to Petascale. IOS Press Amsterdam 624--631. Abouelella F. Bruneel K. and Stroobandt D. 2010b. Towards a more efficient run-time FPGA configuration generation. In Parallel Computing: From Multicores and GPU's to Petascale. IOS Press Amsterdam 624--631.

3. Automatic tool flow for shift-register-LUT reconfiguration

4. Altera. 2001. Application note 119: Implementing high-speed search applications with Altera CAM. Altera San Jose CA. Altera. 2001. Application note 119: Implementing high-speed search applications with Altera CAM. Altera San Jose CA.

5. Altera. 2008. FPGA run-time reconfiguration: Two approaches. Altera San Jose CA. Altera. 2008. FPGA run-time reconfiguration: Two approaches. Altera San Jose CA.

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. An Integrated Approach and Tool Support for the Design of FPGA-Based Multi-Grain Reconfigurable Systems;IEEE Access;2020

2. Reconfigurable FPGA Implementation of the AVC Quantiser and De-quantiser Blocks;Advanced Concepts for Intelligent Vision Systems;2018

3. TCONMAP;ACM Transactions on Design Automation of Electronic Systems;2015-09-28

4. Avoiding transitional effects in dynamic circuit specialisation on FPGAs;Proceedings of the 52nd Annual Design Automation Conference;2015-06-07

5. Enabling FPGA routing configuration sharing in dynamic partial reconfiguration;Design Automation for Embedded Systems;2014-09-18

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3