UNIQ-Reference-Cited by-同舟云学术

UNIQ

Published:2019-11-30 Issue:1-4 Volume:37 Page:1-15
ISSN:0734-2071
Container-title:ACM Transactions on Computer Systems
language:en
Short-container-title:ACM Trans. Comput. Syst.

Author:

Baskin Chaim¹,Liss Natan¹,Schwartz Eli²,Zheltonozhskii Evgenii¹^ORCID,Giryes Raja²,Bronstein Alex M.,Mendelson Avi¹

Affiliation:

1. Technion, Technion, Haifa

2. Tel Aviv University, Tel Aviv, Israel

Abstract

We present a novel method for neural network quantization. Our method, named UNIQ , emulates a non-uniform k -quantile quantizer and adapts the model to perform well with quantized weights by injecting noise to the weights at training time. As a by-product of injecting noise to weights, we find that activations can also be quantized to as low as 8-bit with only a minor accuracy degradation. Our non-uniform quantization approach provides a novel alternative to the existing uniform quantization techniques for neural networks. We further propose a novel complexity metric of number of bit operations performed (BOPs), and we show that this metric has a linear relation with logic utilization and power. We suggest evaluating the trade-off of accuracy vs. complexity (BOPs). The proposed method, when evaluated on ResNet18/34/50 and MobileNet on ImageNet, outperforms the prior state of the art both in the low-complexity regime and the high accuracy regime. We demonstrate the practical applicability of this approach, by implementing our non-uniformly quantized CNN on FPGA.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3444943

Reference35 articles.

1. Deep Learning with Low Precision by Half-Wave Gaussian Quantization

2. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

3. A Pixel Pitch-Matched Ultrasound Receiver for 3-D Photoacoustic Imaging With Integrated Delta-Sigma Beamformer in 28-nm UTBB FD-SOI

Cited by 52 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computational Complexity Optimization of Neural Network-Based Equalizers in Digital Signal Processing: A Comprehensive Approach;Journal of Lightwave Technology;2024-06-15

2. A Trustworthiness Sequence Prediction Scheme Based on Neural Networks and Mathematical Calculations;IEEE Internet of Things Journal;2024-06-15

3. Hardware/Software Codesign of Real-Time Intrusion Detection System for Internet of Things Devices;IEEE Internet of Things Journal;2024-06-15

4. Atalanta: A Bit is Worth a “Thousand” Tensor Values;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2024-04-27

5. Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration;Nature Methods;2024-04-12