Hardware Platform-Aware Binarized Neural Network Model Optimization-Reference-Cited by-同舟云学术

Hardware Platform-Aware Binarized Neural Network Model Optimization

Published:2022-01-26 Issue:3 Volume:12 Page:1296
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Vo Quang Hieu,Asim Faaiz,Alimkhanuly Batyrbek,Lee Seunghyun^ORCID,Kim Lokwon

Abstract

Deep Neural Networks (DNNs) have shown superior accuracy at the expense of high memory and computation requirements. Optimizing DNN models regarding energy and hardware resource requirements is extremely important for applications with resource-constrained embedded environments. Although using binary neural networks (BNNs), one of the recent promising approaches, significantly reduces the design’s complexity, accuracy degradation is inevitable when reducing the precision of parameters and output activations. To balance between implementation cost and accuracy, in addition to proposing specialized hardware accelerators for corresponding specific network models, most recent software binary neural networks have been optimized based on generalized metrics, such as FLOPs or MAC operation requirements. However, with the wide range of hardware available today, independently evaluating software network structures is not good enough to determine the final network model for typical devices. In this paper, an architecture search algorithm based on estimating the hardware performance at the design time is proposed to achieve the best binary neural network models for hardware implementation on target platforms. With the XNOR-net used as a base architecture and target platforms, including Field Programmable Gate Array (FPGA), Graphic Processing Unit (GPU), and Resistive Random Access Memory (RRAM), the proposed algorithm shows its efficiency by giving more accurate estimation for the hardware performance at the design time than FLOPs or MAC operations.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/3/1296/pdf

Reference60 articles.

1. Deep Learning for Computer Vision: A Brief Review

2. Speech Recognition Using Deep Neural Networks: A Systematic Review

3. Optimal brain damage;LeCun,1990

4. Learning both weights and connections for efficient neural networks;Han;arXiv,2015

5. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding;Han;arXiv,2015

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Systematic Literature Review on Binary Neural Networks;IEEE Access;2023