Optimization of Energy Efficiency for FPGA-Based Convolutional Neural Networks Accelerator

Author:

Tang Yongming,Dai Rongshi,Xie Yi

Abstract

Abstract Convolutional neural network (CNN) is widely applied to image recognition with high recognition accuracy. CNN has a wider implementation in general-purpose processors and can be accelerated on FPGA. CNN has a unique way of computing, but general-purpose processors are not efficient for CNN and cannot meet energy efficiency requirements. And the previous studies on FPGA did not involve an energy-efficient implementation on FPGA. We innovatively propose energy efficiency models and implement high energy efficiency CNN on FPGA. We implemented the LeNet-5 network model on the GENESYS 2 board and compared it to the traditional processor and previous studies. By comparison, the computing throughput of CPU, GPU and FPGA are 3.831GFLOPS, 27.143GFLOPS and 19.61GFLOPS respectively, and their powers are 32.15W, 52W, 4.152W respectively. The final energy efficiency (GFLOPS/W) is 0.119GFLOPS/W, 0.522 GFLOPS/W, 4.723 GFLOPS/W, so the energy efficiency of FPGA are far superior to that of CPU and GPU. Since the energy efficiency we achieved on FPGA is also higher than that of FPL2009 and FPGA2015, and we have achieved good experimental results in energy efficiency.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Implementation of intelligent robot target recognition technology based on FPGA development platform;Fourth International Conference on Machine Learning and Computer Application (ICMLCA 2023);2024-05-22

2. Real-time Operational Load Monitoring of a Composite Aerostructure Using FPGA-based Computing System;Bulletin of the Polish Academy of Sciences Technical Sciences;2023-11-02

3. Comparative Energy & Hardware Analysis on Implementation of Full Subtractor Using Different FPGAs Families;2023 6th International Conference on Contemporary Computing and Informatics (IC3I);2023-09-14

4. Energy Efficient Design of Coarse-Grained Reconfigurable Architectures: Insights, Trends and Challenges;2022 International Conference on Field-Programmable Technology (ICFPT);2022-12-05

5. Designing of neural network-based SoSMC for autonomous underwater vehicle: integrating hybrid optimization approach;Soft Computing;2022-11-23

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3