Compression of Deep Convolutional Neural Network Using Additional Importance-Weight-Based Filter Pruning Approach-Reference-Cited by-同舟云学术

Compression of Deep Convolutional Neural Network Using Additional Importance-Weight-Based Filter Pruning Approach

Published:2022-11-04 Issue:21 Volume:12 Page:11184
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Sawant Shrutika S.,Wiedmann Marco^ORCID,Göb Stephan^ORCID,Holzer Nina,Lang Elmar W.,Götz Theresa

Abstract

The success of the convolutional neural network (CNN) comes with a tremendous growth of diverse CNN structures, making it hard to deploy on limited-resource platforms. These over-sized models contain a large amount of filters in the convolutional layers, which are responsible for almost 99% of the computation. The key question here arises: Do we really need all those filters? By removing entire filters, the computational cost can be significantly reduced. Hence, in this article, a filter pruning method, a process of discarding a subset of unimportant or weak filters from the original CNN model, is proposed, which alleviates the shortcomings of over-sized CNN architectures at the cost of storage space and time. The proposed filter pruning strategy is adopted to compress the model by assigning additional importance weights to convolutional filters. These additional importance weights help each filter learn its responsibility and contribute more efficiently. We adopted different initialization strategies to learn more about filters from different aspects and prune accordingly. Furthermore, unlike existing pruning approaches, the proposed method uses a predefined error tolerance level instead of the pruning rate. Extensive experiments on two widely used image segmentation datasets: Inria and AIRS, and two widely known CNN models for segmentation: TernausNet and standard U-Net, verify that our pruning approach can efficiently compress CNN models with almost negligible or no loss of accuracy. For instance, our approach could significantly reduce 85% of all floating point operations (FLOPs) from TernausNet on Inria with a negligible drop of 0.32% in validation accuracy. This compressed network is six-times smaller and almost seven-times faster (on a cluster of GPUs) than that of the original TernausNet, while the drop in the accuracy is less than 1%. Moreover, we reduced the FLOPs by 84.34% without significantly deteriorating the output performance on the AIRS dataset for TernausNet. The proposed pruning method effectively reduced the number of FLOPs and parameters of the CNN model, while almost retaining the original accuracy. The compact model can be deployed on any embedded device without any specialized hardware. We show that the performance of the pruned CNN model is very similar to that of the original unpruned CNN model. We also report numerous ablation studies to validate our approach.

Funder

European Research Consortium

Mathematics (ERCIM) fellowship program

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/21/11184/pdf

Reference68 articles.

1. A Hyperspectral Image Classification Method Using Multifeature Vectors and Optimized KELM;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2021

2. Hasan, A.M., and Shin, J. (2022). Online Kanji Characters Based Writer Identification Using Sequential Forward Floating Selection and Support Vector Machine. Appl. Sci., 12.

3. Unsupervised band selection based on weighted information entropy and 3D discrete cosine transform for hyperspectral image classification;Int. J. Remote Sens.,2020

4. Dynamic hybrid mechanism-based differential evolution algorithm and its application;Expert Syst. Appl.,2023

5. Adaptive transfer learning-based multiscale feature fused deep convolutional neural network for EEG MI multiclassification in brain–computer interface;Eng. Appl. Artif. Intell.,2022

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Self-distillation enhanced adaptive pruning of convolutional neural networks;Pattern Recognition;2025-01

2. Channel Pruning of Transfer Learning Models Using Novel Techniques;IEEE Access;2024

3. Targeted Customer Selling Strategy for Electric Vehicles Based on Decision Tree Modeling;Advances in Applied Mathematics;2024

4. An adaptive binary particle swarm optimization for solving multi-objective convolutional filter pruning problem;The Journal of Supercomputing;2023-03-23

5. Application of Improved Process Neural Network Based on the Fireworks Algorithm in the Temperature-Rise Predictions of a Large Generator Rotor;Applied Sciences;2023-02-24