FPGA-Based Convolutional Neural Network Accelerator with Resource-Optimized Approximate Multiply-Accumulate Unit-Reference-Cited by-同舟云学术

FPGA-Based Convolutional Neural Network Accelerator with Resource-Optimized Approximate Multiply-Accumulate Unit

Published:2021-11-19 Issue:22 Volume:10 Page:2859
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Cho Mannhee,Kim Youngmin^ORCID

Abstract

Convolutional neural networks (CNNs) are widely used in modern applications for their versatility and high classification accuracy. Field-programmable gate arrays (FPGAs) are considered to be suitable platforms for CNNs based on their high performance, rapid development, and reconfigurability. Although many studies have proposed methods for implementing high-performance CNN accelerators on FPGAs using optimized data types and algorithm transformations, accelerators can be optimized further by investigating more efficient uses of FPGA resources. In this paper, we propose an FPGA-based CNN accelerator using multiple approximate accumulation units based on a fixed-point data type. We implemented the LeNet-5 CNN architecture, which performs classification of handwritten digits using the MNIST handwritten digit dataset. The proposed accelerator was implemented, using a high-level synthesis tool on a Xilinx FPGA. The proposed accelerator applies an optimized fixed-point data type and loop parallelization to improve performance. Approximate operation units are implemented using FPGA logic resources instead of high-precision digital signal processing (DSP) blocks, which are inefficient for low-precision data. Our accelerator model achieves 66% less memory usage and approximately 50% reduced network latency, compared to a floating point design and its resource utilization is optimized to use 78% fewer DSP blocks, compared to general fixed-point designs.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/22/2859/pdf

Reference38 articles.

1. Mobilenets: Efficient convolutional neural networks for mobile vision applications;Howard;arXiv,2017

2. Convolutional neural networks: an overview and application in radiology

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A new multi-objective hyperparameter optimization algorithm for COVID-19 detection from x-ray images;Soft Computing;2024-07-23

2. Real-time edge computing design for physiological signal analysis and classification;Biomedical Physics & Engineering Express;2024-06-04

3. Construction and Application of a Neuromorphic Circuit With Excitatory and Inhibitory Post-Synaptic Conduction Channels Implemented Using Dual-Gate Thin-Film Transistors;IEEE Transactions on Circuits and Systems I: Regular Papers;2024-04

4. A Configurable Approximate Multiplier for CNNs Using Partial Product Speculation;2024 Design, Automation & Test in Europe Conference & Exhibition (DATE);2024-03-25

5. Stochastic Computing Convolution Neural Network Architecture Reinvented For Highly Efficient Artificial Intelligence Workload on Field Programmable Gate Array;Research;2024-01-08