A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for CNNs and Its FPGA Implementation-Reference-Cited by-同舟云学术

A Hardware-Friendly Low-Bit Power-of-Two Quantization Method for CNNs and Its FPGA Implementation

Published:2022-09-01 Issue:17 Volume:22 Page:6618
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Sui Xuefu,Lv Qunbo,Bai Yang,Zhu Baoyu,Zhi Liangjie^ORCID,Yang Yuanbo,Tan Zheng

Abstract

To address the problems of convolutional neural networks (CNNs) consuming more hardware resources (such as DSPs and RAMs on FPGAs) and their accuracy, efficiency, and resources being difficult to balance, meaning they cannot meet the requirements of industrial applications, we proposed an innovative low-bit power-of-two quantization method: the global sign-based network quantization (GSNQ). This method involves designing different quantization ranges according to the sign of the weights, which can provide a larger quantization-value range. Combined with the fine-grained and multi-scale global retraining method proposed in this paper, the accuracy loss of low-bit quantization can be effectively reduced. We also proposed a novel convolutional algorithm using shift operations to replace multiplication to help to deploy the GSNQ quantized models on FPGAs. Quantization comparison experiments performed on LeNet-5, AlexNet, VGG-Net, ResNet, and GoogLeNet showed that GSNQ has higher accuracy than most existing methods and achieves “lossless” quantization (i.e., the accuracy of the quantized CNN model is higher than the baseline) at low-bit quantization in most cases. FPGA comparison experiments showed that our convolutional algorithm does not occupy on-chip DSPs, and it also has a low comprehensive occupancy in terms of on-chip LUTs and FFs, which can effectively improve the computational parallelism, and this proves that GSNQ has good hardware-adaptation capability. This study provides theoretical and experimental support for the industrial application of CNNs.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/17/6618/pdf

Reference58 articles.

1. Very Deep Convolutional Networks for Large-Scale Image Recognition;Simonyan;arXiv,2014

2. ImageNet classification with deep convolutional neural networks

3. Deep Residual Learning for Image Recognition;He;Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2016

4. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation;Girshick;arXiv,2013

5. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation;Journal of Real-Time Image Processing;2024-06-06

2. TinyEmergencyNet: a hardware-friendly ultra-lightweight deep learning model for aerial scene image classification;Journal of Real-Time Image Processing;2024-03-13

3. Adaptive Global Power-of-Two Ternary Quantization Algorithm Based on Unfixed Boundary Thresholds;Sensors;2023-12-28

4. SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs;2023 International Conference on Field Programmable Technology (ICFPT);2023-12-12

5. Quantization-Aware NN Layers with High-throughput FPGA Implementation for Edge AI;Sensors;2023-05-11