Recursive least squares method for training and pruning convolutional neural networks-Reference-Cited by-同舟云学术

Recursive least squares method for training and pruning convolutional neural networks

Published:2023-07-26 Issue:20 Volume:53 Page:24603-24618
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Yu Tianzong,Zhang Chunyuan^ORCID,Ma Meng,Wang Yuan

Abstract

AbstractConvolutional neural networks (CNNs) have shown good performance in many practical applications. However, their high computational and storage requirements make them difficult to deploy on resource-constrained devices. To address this issue, in this paper, we propose a novel iterative structured pruning algorithm for CNNs based on the recursive least squares (RLS) optimization. Our algorithm combines inverse input autocorrelation matrices with weight matrices to evaluate and prune unimportant input channels or nodes in each CNN layer and performs the next pruning operation when the testing loss is tuned down to the last unpruned level. Our algorithm can be used to prune feedforward neural networks (FNNs) as well. The fast convergence speed of the RLS optimization allows our algorithm to prune CNNs and FNNs multiple times in a small number of epochs. We validate its effectiveness in pruning VGG-16 and ResNet-50 on CIFAR-10 and CIFAR-100 and pruning a three-layer FNN on MNIST. Compared with four popular pruning algorithms, our algorithm can adaptively prune CNNs according to the learning task difficulty and can effectively prune CNNs and FNNs with a small or even no reduction in accuracy. In addition, our algorithm can prune the original sample features in the input layer.

Funder

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10489-023-04740-z.pdf

Reference44 articles.

1. Gabor M, Zdunek R (2023) Compressing convolutional neural networks with hierarchical tucker-2 decomposition. Appl Soft Comput 132:109856. https://doi.org/10.1016/j.asoc.2022.109856

2. Liu H, Liu T, Zhang Z, Sangaiah AK, Yang B, Li Y (2022) ARHPE: Asymmetric relation-aware representation learning for head pose estimation in industrial human-computer interaction. IEEE Trans Industr Inform 18(10):7107–7117. https://doi.org/10.1109/TII.2022.3143605

3. Liu, H, Liu, T, Chen, Y, Zhang, Z, Li Y-F(2022) EHPE: Skeleton cues-based gaussian coordinate encoding for efficient human pose estimation. IEEE Trans Multimedia, pp 1–12. https://doi.org/10.1109/TMM.2022.3197364

4. Liu T, Wang J, Yang B, Wang X (2021) NGDNet: Nonuniform gaussian-label distribution learning for infrared head pose estimation and on-task behavior understanding in the classroom. Neurocomputing 436:210–220. https://doi.org/10.1016/j.neucom.2020.12.090

5. Liu H, Fang S, Zhang Z, Li D, Lin K, Wang J (2022) MFDNet: Collaborative poses perception and matrix fisher distribution for head pose estimation. IEEE Trans Multimedia 24:2449–2460. https://doi.org/10.1109/TMM.2021.3081873

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. StreamTrack: real-time meta-detector for streaming perception in full-speed domain driving scenarios;Applied Intelligence;2024-09-10