Efficient and Controllable Model Compression through Sequential Knowledge Distillation and Pruning-Reference-Cited by-同舟云学术

Efficient and Controllable Model Compression through Sequential Knowledge Distillation and Pruning

Published:2023-09-19 Issue:3 Volume:7 Page:154
ISSN:2504-2289
Container-title:Big Data and Cognitive Computing
language:en
Short-container-title:BDCC

Author:

Malihi Leila¹,Heidemann Gunther¹

Affiliation:

1. Institute of Cognitive Science, Osnabrück University, 49074 Osnabrück, Germany

Abstract

Efficient model deployment is a key focus in deep learning. This has led to the exploration of methods such as knowledge distillation and network pruning to compress models and increase their performance. In this study, we investigate the potential synergy between knowledge distillation and network pruning to achieve optimal model efficiency and improved generalization. We introduce an innovative framework for model compression that combines knowledge distillation, pruning, and fine-tuning to achieve enhanced compression while providing control over the degree of compactness. Our research is conducted on popular datasets, CIFAR-10 and CIFAR-100, employing diverse model architectures, including ResNet, DenseNet, and EfficientNet. We could calibrate the amount of compression achieved. This allows us to produce models with different degrees of compression while still being just as accurate, or even better. Notably, we demonstrate its efficacy by producing two compressed variants of ResNet 101: ResNet 50 and ResNet 18. Our results reveal intriguing findings. In most cases, the pruned and distilled student models exhibit comparable or superior accuracy to the distilled student models while utilizing significantly fewer parameters.

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems

Link

https://www.mdpi.com/2504-2289/7/3/154/pdf

Reference23 articles.

1. Hinton, G.E., Vinyals, O., and Dean, J. (2015). Distilling the Knowledge in a Neural Network. arXiv.

2. Tian, Y., Krishnan, D., and Isola, P. (2022). Contrastive Representation Distillation. arXiv.

3. Tung, F., and Mori, G. (November, January 27). Similarity-Preserving Knowledge Distillation. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea.

4. LeCun, Y., Denker, J., and Solla, S. (1989). Proceedings of the Advances in Neural Information Processing Systems, Morgan-Kaufmann.

5. Zagoruyko, S., and Komodakis, N. (2017). Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer. arXiv.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Matching the Ideal Pruning Method with Knowledge Distillation for Optimal Compression;Applied System Innovation;2024-06-29