HMC: Hybrid model compression method based on layer sensitivity grouping-Reference-Cited by-同舟云学术

HMC: Hybrid model compression method based on layer sensitivity grouping

Published:2023-10-09 Issue:10 Volume:18 Page:e0292517
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Yang Guoliang,Yu Shuaiying^ORCID,Yang Hao,Nie Ziling,Wang Jixiang

Abstract

Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characteristics in the model will lead to low accuracy and a low compression rate of deep compression. To make full use of the difference between low-rank and sparsity, a unified framework combining low-rank tensor decomposition and structured pruning is proposed: a hybrid model compression method based on sensitivity grouping (HMC). This framework unifies the existing additive hybrid compression method (AHC) and the non-additive hybrid compression method (NaHC) proposed by us into one model. The latter group the network according to the sensitivity difference of the convolutional layer to different compression methods, which can better integrate the low rank and sparsity of the model compared with the former. Experiments show that our approach achieves a better trade-off between test accuracy and compression ratio when compressing the ResNet family of models than other recent compression methods using a single strategy or additive hybrid compression.

Funder

Jiangxi Provincial Department of Education

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference45 articles.

1. C. -J. Wu et al., "Machine Learning at Facebook: Understanding Inference at the Edge," 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA), Washington, DC, USA, 2019, pp. 331–344.

2. Spectrum interference-based two-level data augmentation method in deep learning for automatic modulation classification;Q. Zheng;Neural Comput & Applic,2021

3. MR-DCAE: Manifold regularization-based deep convolutional autoencoder for unauthorized broadcasting identification[J];Q Zheng;International Journal of Intelligent Systems,2021

4. Fine-Grained Modulation Classification Using Multi-Scale Radio Transformer With Dual-Channel Representation;Q. Zheng;IEEE Communications Letters,2022

5. DL-PR: Generalized automatic modulation classification method based on deep learning with priori regularization[J];Q Zheng;Engineering Applications of Artificial Intelligence,2023