An FPGA-Based CNN Accelerator Integrating Depthwise Separable Convolution-Reference-Cited by-同舟云学术

An FPGA-Based CNN Accelerator Integrating Depthwise Separable Convolution

Published:2019-03-03 Issue:3 Volume:8 Page:281
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Liu Bing^ORCID,Zou Danyin,Feng Lei,Feng Shou,Fu Ping,Li Junbao

Abstract

The Convolutional Neural Network (CNN) has been used in many fields and has achieved remarkable results, such as image classification, face detection, and speech recognition. Compared to GPU (graphics processing unit) and ASIC, a FPGA (field programmable gate array)-based CNN accelerator has great advantages due to its low power consumption and reconfigurable property. However, FPGA’s extremely limited resources and CNN’s huge amount of parameters and computational complexity pose great challenges to the design. Based on the ZYNQ heterogeneous platform and the coordination of resource and bandwidth issues with the roofline model, the CNN accelerator we designed can accelerate both standard convolution and depthwise separable convolution with a high hardware resource rate. The accelerator can handle network layers of different scales through parameter configuration and maximizes bandwidth and achieves full pipelined by using a data stream interface and ping-pong on-chip cache. The experimental results show that the accelerator designed in this paper can achieve 17.11GOPS for 32bit floating point when it can also accelerate depthwise separable convolution, which has obvious advantages compared with other designs.

Funder

National Natural Science Foundation of China

Open Projects Program of National Laboratory of Pattern Recognition

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/8/3/281/pdf

Reference27 articles.

1. Visualization and Interpretation of Convolutional Neural Network Predictions in Detecting Pneumonia in Pediatric Chest Radiographs

2. Vehicle-Type Detection Based on Compressed Sensing and Deep Learning in Vehicular Networks

3. ImageNet classification with deep convolutional neural networks

4. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Cited by 67 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lightweight convolutional neural network-based plant disease identification for protection and landscape design;Crop Protection;2024-10

2. ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs;Future Generation Computer Systems;2024-09

3. Fpga-based SoC design for real-time facial point detection using deep convolutional neural networks with dynamic partial reconfiguration;Signal, Image and Video Processing;2024-05-14

4. Pflow: An end-to-end heterogeneous acceleration framework for CNN inference on FPGAs;Journal of Systems Architecture;2024-05

5. Diagnosis of Parkinson's Disease Using Convolutional Neural Network-Based Audio Signal Processing on FPGA;Circuits, Systems, and Signal Processing;2024-04-29