Design of a 2-Bit Neural Network Quantizer for Laplacian Source-Reference-Cited by-同舟云学术

Design of a 2-Bit Neural Network Quantizer for Laplacian Source

Published:2021-07-22 Issue:8 Volume:23 Page:933
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Perić Zoran,Savić Milan^ORCID,Simić Nikola^ORCID,Denić Bojan,Despotović Vladimir

Abstract

Achieving real-time inference is one of the major issues in contemporary neural network applications, as complex algorithms are frequently being deployed to mobile devices that have constrained storage and computing power. Moving from a full-precision neural network model to a lower representation by applying quantization techniques is a popular approach to facilitate this issue. Here, we analyze in detail and design a 2-bit uniform quantization model for Laplacian source due to its significance in terms of implementation simplicity, which further leads to a shorter processing time and faster inference. The results show that it is possible to achieve high classification accuracy (more than 96% in the case of MLP and more than 98% in the case of CNN) by implementing the proposed model, which is competitive to the performance of the other quantization solutions with almost optimal precision.

Funder

Science Fund of the Republic of Serbia

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/8/933/pdf

Reference41 articles.

1. Ristretto: A Framework for Empirical Study of Resource-Efficient Inference in Convolutional Neural Networks

2. Classification and Regression Trees;Breiman,1984

3. Quantizability and learning complexity in multilayer neural networks

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Degree-Aware Graph Neural Network Quantization;Entropy;2023-11-02

2. Performance Analysis of a 2-bit Dual-Mode Uniform Scalar Quantizer for Laplacian Source;Information Technology and Control;2022-12-12

3. Analysis of Neural Network Accuracy Degradation due to Uniform Weight Quantization of One or More Layers;2022 57th International Scientific Conference on Information, Communication and Energy Systems and Technologies (ICEST);2022-06-16

4. Performance of Post-Training Two-Bits Uniform and Layer-Wise Uniform Quantization for MNIST Dataset from the Perspective of Support Region Choice;Mathematical Problems in Engineering;2022-04-07

5. On Different Criteria for Optimizing the Two-bit Uniform Quantizer;2022 21st International Symposium INFOTEH-JAHORINA (INFOTEH);2022-03-16