The Compression Techniques Applied on Deep Learning Model-Reference-Cited by-同舟云学术

The Compression Techniques Applied on Deep Learning Model

Published:2022-07-26 Issue: Volume:4 Page:325-331
ISSN:2791-0210
Container-title:Highlights in Science, Engineering and Technology
language:
Short-container-title:HSET

Author:

He Haoyuan,Huang Lingxuan,Huang Zisen,Yang Tiantian

Abstract

In recent years, the penetration rate of smartphones has gradually completed, artificial intelligence is the cutting-edge technology that can trigger disruptive changes. Deep learning neural networks are also starting to appear on mobile devices. In order to obtain better performance, more complex networks need to be designed, and the corresponding models, computation and storage space are increasing, however, the challenges of resource allocation and energy consumption still exist in mobile. The techniques for compressing deep learning models are quite important, and this paper studies a series of related literatures. This paper reviews deep learning-based deep neural network compression techniques and introduces the key operational points of knowledge extraction and network model on the learning performance of Resolution-Aware Knowledge Distillation. In this paper, a low-rank decomposition algorithm is evaluated based on sparse parameters and rank using the extended BIC for tuning parameter selection. This paper discusses the reduction of redundancy in the fully connected and constitutive layers of the training network model by pruning strategies.Moreover, this paper presents the quantization techniques and a neural network that quantifies weights and activations by applying differentiable nonlinear functions.

Publisher

Darcy & Roy Press Co. Ltd.

Reference20 articles.

1. G. Hinton, O. Vinyals, J. Dean. Distilling the Knowledge in a Neural Network[J]. Computer Science, 2015

2. X. Chen, Z. Q. Xing and Y. Y. Cheng, "Introduction to Model Compression Knowledge Distillation," 2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP), 2021, pp. 1464-1467, doi: 10.1109/ICSP51882.2021.9408881.

3. I. -H. Shin, Y. -H. Moon and Y. -J. Lee, "Towards Understanding Architectural Effects on Knowledge Distillation," 2020 International Conference on Information and Communication Technology Convergence (ICTC), 2020, pp. 1144-1146, doi: 10.1109/ICTC49870.2020.9289630.

4. H. Ni, J. Shen and C. Yuan, "Enhanced Knowledge Distillation for Face Recognition," 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), 2019, pp. 1441-1444, doi: 10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00207.

5. Z. Feng, J. Lai and X. Xie, "Resolution-Aware Knowledge Distillation for Efficient Inference," in IEEE Transactions on Image Processing, vol. 30, pp. 6985-6996, 2021, doi: 10.1109/TIP.2021.3101158.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comprehensive review of model compression techniques in machine learning;Applied Intelligence;2024-09-02