Advances in the Neural Network Quantization: A Comprehensive Review-Reference-Cited by-同舟云学术

Advances in the Neural Network Quantization: A Comprehensive Review

Published:2024-08-23 Issue:17 Volume:14 Page:7445
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Wei Lu¹²,Ma Zhong²^ORCID,Yang Chaojie²^ORCID,Yao Qin¹²

Affiliation:

1. School of Software, Northwestern Polytechnical University, Xi’an 710072, China

2. Xi’an Microelectronics Technology Institute, Xi’an 710065, China

Abstract

Artificial intelligence technologies based on deep convolutional neural networks and large language models have made significant breakthroughs in many tasks, such as image recognition, target detection, semantic segmentation, and natural language processing, but also face a conflict between the high computational capacity of the algorithms and limited deployment resources. Quantization, which converts floating-point neural networks into low-bit-width integer networks, is an important and essential technique for efficient deployment and cost reduction in edge computing. This paper analyzes various existing quantization methods, showcases the deployment accuracy of advanced techniques, and discusses the future challenges and trends in this domain.

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/17/7445/pdf

Reference63 articles.

1. Liu, Z., Hu, H., Lin, Y., Yao, Z., Xie, Z., Wei, Y., Ning, J., Cao, Y., Zhang, Z., and Dong, L. (2022, January 18–24). Swin transformer v2: Scaling up capacity and resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.

2. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11–17). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE International Conference on Computer Vision, Montreal, BC, Canada.

3. Zhang, H., Li, F., Liu, S., Zhang, L., Su, H., Zhu, J., Ni, L.M., and Shum, H.-Y. (2023). DINO: DETR with improved denoising anchor boxes for end-to-end object detection. In International Conference on Learning Representations. arXiv.

4. Zong, Z., Song, G., and Liu, Y. (2023, January 2–6). Detrs with collaborative hybrid assignments training. Proceedings of the International Conference on Computer Vision, Paris, France.

5. Unmanned system swarm intelligence and its research progresses;Zhou;Microelectron. Comput.,2021