Pruning Deep Neural Network Models via Minimax Concave Penalty Regression-Reference-Cited by-同舟云学术

Pruning Deep Neural Network Models via Minimax Concave Penalty Regression

Published:2024-04-25 Issue:9 Volume:14 Page:3669
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Liu Xinggu¹,Zhou Lin¹,Luo Youxi¹

Affiliation:

1. School of Science, Hubei University of Technology, Wuhan 430068, China

Abstract

In this study, we propose a filter pruning method based on MCP (Minimax Concave Penalty) regression. The convolutional process is conceptualized as a linear regression procedure, and the regression coefficients serve as indicators to assess the redundancy of channels. In the realm of feature selection, the efficacy of sparse penalized regression gradually outperforms that of Lasso regression. Building upon this insight, MCP regression is introduced to screen convolutional channels, coupled with the coordinate descent method, to effectuate model compression. In single-layer pruning and global pruning analyses, the Top1 loss value associated with the MCP regression compression method is consistently smaller than that of the Lasso regression compression method across diverse models. Specifically, when the global pruning ratio is set to 0.3, the Top1 accuracy of the MCP regression compression method, in comparison with that of the Lasso regression compression method, exhibits improvements of 0.21% and 1.67% under the VGG19_Simple and VGG19 models, respectively. Similarly, for ResNet34, at two distinct pruning ratios, the Top1 accuracy demonstrates enhancements of 0.33% and 0.26%. Lastly, we compare and discuss the novel methods introduced in this study, considering both time and space resource consumption.

Funder

National Social Science Fund of China

National Natural Science Foundation of China

Key Humanities and Social Science Fund of Hubei Provincial Department of Education

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/9/3669/pdf

Reference43 articles.

1. Li, Y.L. (2022). Model Compression of Deep Neural Networks. [Master’s Thesis, University of Electronic Science and Technology of China].

2. Xu, J.H. (2020). Research on Model Compression and Acceleration of Deep Neural Networks Based on Model Pruning. [Master’s Thesis, Southeast University].

3. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998

4. Research on hyperparameter tuning strategies based on VGG16 network;Zhang;Sci. Innov.,2021

5. Chen, W.J. (2021). Design and Implementation of a High-Speed and High-Precision Matrix Inverter. [Master’s Thesis, Hefei University of Technology].