Reconfigurable Binary Neural Network Accelerator with Adaptive Parallelism Scheme-Reference-Cited by-同舟云学术

Reconfigurable Binary Neural Network Accelerator with Adaptive Parallelism Scheme

Published:2021-01-20 Issue:3 Volume:10 Page:230
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Cho Jaechan^ORCID,Jung Yongchul^ORCID,Lee Seongjoo^ORCID,Jung Yunho^ORCID

Abstract

Binary neural networks (BNNs) have attracted significant interest for the implementation of deep neural networks (DNNs) on resource-constrained edge devices, and various BNN accelerator architectures have been proposed to achieve higher efficiency. BNN accelerators can be divided into two categories: streaming and layer accelerators. Although streaming accelerators designed for a specific BNN network topology provide high throughput, they are infeasible for various sensor applications in edge AI because of their complexity and inflexibility. In contrast, layer accelerators with reasonable resources can support various network topologies, but they operate with the same parallelism for all the layers of the BNN, which degrades throughput performance at certain layers. To overcome this problem, we propose a BNN accelerator with adaptive parallelism that offers high throughput performance in all layers. The proposed accelerator analyzes target layer parameters and operates with optimal parallelism using reasonable resources. In addition, this architecture is able to fully compute all types of BNN layers thanks to its reconfigurability, and it can achieve a higher area–speed efficiency than existing accelerators. In performance evaluation using state-of-the-art BNN topologies, the designed BNN accelerator achieved an area–speed efficiency 9.69 times higher than previous FPGA implementations and 24% higher than existing VLSI implementations for BNNs.

Funder

Institute for Information and Communications Technology Promotion

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/3/230/pdf

Reference39 articles.

1. A Survey on the New Generation of Deep Learning in Image Processing

2. A State-of-the-Art Survey on Deep Learning Theory and Architectures

3. CNN-Based Vehicle Target Recognition with Residual Compensation for Circular SAR Imaging

4. Very Deep Convolutional Networks for Large-Scale Image Recognition;Simonyan;arXiv,2014

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lightweight Deep Learning for Resource-Constrained Environments: A Survey;ACM Computing Surveys;2024-06-24

2. FPGA Implementation of a Fault-Tolerant Fused and Branched CNN Accelerator With Reconfigurable Capabilities;IEEE Access;2024

3. BrainTTA: A 28.6 TOPS/W Compiler Programmable Transport-Triggered NN SoC;2023 IEEE 41st International Conference on Computer Design (ICCD);2023-11-06

4. A Design of BNN Accelerator using Gate-level Pipelined Self-Synchronous Circuit;2023 International Conference on IC Design and Technology (ICICDT);2023-09-25

5. Spike time displacement-based error backpropagation in convolutional spiking neural networks;Neural Computing and Applications;2023-04-19