Advances in the Neural Network Quantization: A Comprehensive Review
-
Published:2024-08-23
Issue:17
Volume:14
Page:7445
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Wei Lu12, Ma Zhong2ORCID, Yang Chaojie2ORCID, Yao Qin12
Affiliation:
1. School of Software, Northwestern Polytechnical University, Xi’an 710072, China 2. Xi’an Microelectronics Technology Institute, Xi’an 710065, China
Abstract
Artificial intelligence technologies based on deep convolutional neural networks and large language models have made significant breakthroughs in many tasks, such as image recognition, target detection, semantic segmentation, and natural language processing, but also face a conflict between the high computational capacity of the algorithms and limited deployment resources. Quantization, which converts floating-point neural networks into low-bit-width integer networks, is an important and essential technique for efficient deployment and cost reduction in edge computing. This paper analyzes various existing quantization methods, showcases the deployment accuracy of advanced techniques, and discusses the future challenges and trends in this domain.
Reference63 articles.
1. Liu, Z., Hu, H., Lin, Y., Yao, Z., Xie, Z., Wei, Y., Ning, J., Cao, Y., Zhang, Z., and Dong, L. (2022, January 18–24). Swin transformer v2: Scaling up capacity and resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA. 2. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11–17). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE International Conference on Computer Vision, Montreal, BC, Canada. 3. Zhang, H., Li, F., Liu, S., Zhang, L., Su, H., Zhu, J., Ni, L.M., and Shum, H.-Y. (2023). DINO: DETR with improved denoising anchor boxes for end-to-end object detection. In International Conference on Learning Representations. arXiv. 4. Zong, Z., Song, G., and Liu, Y. (2023, January 2–6). Detrs with collaborative hybrid assignments training. Proceedings of the International Conference on Computer Vision, Paris, France. 5. Unmanned system swarm intelligence and its research progresses;Zhou;Microelectron. Comput.,2021
|
|