On Predictable Reconfigurable System Design

Author:

Voss Nils1,Kwaadgras Bastiaan2,Mencer Oskar2,Luk Wayne3,Gaydadjiev Georgi4

Affiliation:

1. Imperial College London, UK and Maxeler Technologies, London, UK

2. Maxeler Technologies, London, UK

3. Imperial College London, London, UK

4. Maxeler IoT-Labs and Imperial College London and Univ. of Groningen

Abstract

We propose a design methodology to facilitate rigorous development of complex applications targeting reconfigurable hardware. Our methodology relies on analytical estimation of system performance and area utilisation for a given specific application and a particular system instance consisting of a controlflow machine working in conjunction with one or more reconfigurable dataflow accelerators. The targeted application is carefully analyzed, and the parts identified for hardware acceleration are reimplemented as a set of representative software models. Next, with the results of the application analysis, a suitable system architecture is devised and its performance is evaluated to determine bottlenecks, allowing predictable design. The architecture is iteratively refined, until the final version satisfying the specification requirements in terms of performance and required hardware area is obtained. We validate the presented methodology using a widely accepted convolutional neural network (VGG-16) and an important HPC application (BQCD). In both cases, our methodology relieved and alleviated all system bottlenecks before the hardware implementation was started. As a result the architectures were implemented first time right, achieving state-of-the-art performance within 15% of our modelling estimations.

Funder

Maxeler, Intel, and Xilinx is gratefully acknowledged

United Kingdom EPSRC

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Reference93 articles.

1. Gzip on a chip

2. Alibaba. [n.d.]. AliBaba f2 Instance. Retrieved from https://www.alibabacloud.com/help/doc-detail/25378.htm#concept-sx4-lxv-tdb-f2. Alibaba. [n.d.]. AliBaba f2 Instance. Retrieved from https://www.alibabacloud.com/help/doc-detail/25378.htm#concept-sx4-lxv-tdb-f2.

3. Amazon. [n.d.]. Amazon F1 Instance. Retrieved from https://aws.amazon.com/ec2/instance-types/f1/. Amazon. [n.d.]. Amazon F1 Instance. Retrieved from https://aws.amazon.com/ec2/instance-types/f1/.

4. J. Arram K. H. Tsoi Wayne Luk and P. Jiang. 2013. Hardware acceleration of genetic sequence alignment. In Reconfigurable Computing: Architectures Tools and Applications. Springer Berlin 13--24. J. Arram K. H. Tsoi Wayne Luk and P. Jiang. 2013. Hardware acceleration of genetic sequence alignment. In Reconfigurable Computing: Architectures Tools and Applications. Springer Berlin 13--24.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. ExaFlexHH: an exascale-ready, flexible multi-FPGA library for biologically plausible brain simulations;Frontiers in Neuroinformatics;2024-04-12

2. FPGA Acceleration for HPC Supercapacitor Simulations;Proceedings of the Platform for Advanced Scientific Computing Conference;2023-06-26

3. Distributed large-scale graph processing on FPGAs;Journal of Big Data;2023-06-04

4. Demonstration of FPGA Acceleration of Monte Carlo Simulation;Journal of Physics: Conference Series;2023-02-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3