Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks-Reference-Cited by-同舟云学术

Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

Published:2024-02-22 Issue:1-2 Volume:52 Page:40-58
ISSN:0885-7458
Container-title:International Journal of Parallel Programming
language:en
Short-container-title:Int J Parallel Prog

Author:

Heidorn Christian,Sabih Muhammad^ORCID,Meyerhöfer Nicolai,Schinabeck Christian^ORCID,Teich Jürgen^ORCID,Hannig Frank^ORCID

Abstract

AbstractFilter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the

$$\ell ^1$$

ℓ 1 -norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.

Funder

Bundesministerium für Bildung und Forschung

Friedrich-Alexander-Universität Erlangen-Nürnberg

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10766-024-00760-5.pdf

Reference31 articles.

1. Hoefler, T., et al.: Sparsity in deep learning: pruning and growth for efficient inference and training in neural networks. J. Mach. Learn. Res. 22, 241:1-241:124 (2021)

2. Zhang, Y., et al.: Improvement of efficiency in evolutionary pruning . In: Proceedings of the International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE (2021). https://doi.org/10.1109/IJCNN52387.2021.9534055

3. Lin, M., et al.: Channel pruning via automatic structure search . In: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI), pp. 673–679 (2020). https://doi.org/10.24963/ijcai.2020/94

4. Zhou, Y., Yen, G.G., Yi, Z.: A knee-guided evolutionary algorithm for compressing deep neural networks. IEEE Trans. Cybern. 51(3), 1626–1638 (2021). https://doi.org/10.1109/TCYB.2019.2928174

5. Heidorn, C., et al.: Hardware-aware evolutionary filter pruning . In: Embedded Computer Systems: Architectures, Modeling, and Simulation—22nd International Conference, SAMOS 2022, Samos, Greece, July 3–7, 2022, Proceedings, vol. 13511. Lecture Notes in Computer Science, pp. 283–299. Springer, (2022). https://doi.org/10.1007/978-3-031-15074-6_18

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OpTC – A Toolchain for Deployment of Neural Networks on AURIX TC3xx Microcontrollers;Proceedings;2024