A roulette wheel-based pruning method to simplify cumbersome deep neural networks-Reference-Cited by-同舟云学术

A roulette wheel-based pruning method to simplify cumbersome deep neural networks

Published:2024-05-02 Issue:22 Volume:36 Page:13915-13933
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Chan Kit Yan^ORCID,Yiu Ka Fai Cedric,Guo Shan,Jiang Huimin

Abstract

AbstractDeep neural networks (DNNs) have been applied in many pattern recognition or object detection applications. DNNs generally consist of millions or even billions of parameters. These demanding computational storage and requirements impede deployments of DNNs in resource-limited devices, such as mobile devices, micro-controllers. Simplification techniques such as pruning have commonly been used to slim DNN sizes. Pruning approaches generally quantify the importance of each component such as network weight. Weight values or weight gradients in training are commonly used as the importance metric. Small weights are pruned and large weights are kept. However, small weights are possible to be connected with significant weights which have impact to DNN outputs. DNN accuracy can be degraded significantly after the pruning process. This paper proposes a roulette wheel-like pruning algorithm, in order to simplify a trained DNN while keeping the DNN accuracy. The proposed algorithm generates a branch of pruned DNNs which are generated by a roulette wheel operator. Similar to the roulette wheel selection in genetic algorithms, small weights are more likely to be pruned but they can be kept; large weights are more likely to be kept but they can be pruned. The slimmest DNN with the best accuracy is selected from the branch. The performance of the proposed pruning algorithm is evaluated by two deterministic datasets and four non-deterministic datasets. Experimental results show that the proposed pruning algorithm generates simpler DNNs while DNN accuracy can be kept, compared to several existing pruning approaches.

Funder

Curtin University

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00521-024-09719-6.pdf

Reference41 articles.

1. Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444

2. Bao RX, Yuan X, Chen ZK, Ma RX (2018) Cross-entropy pruning for compressing convolutional neural networks. Neural Comput 30(11):3128–3149

3. Liang T, Glossner J, Wanga L, Shi SB, Zhang XT (2021) Pruning and quantization for deep neural network acceleration: a survey. Neurocomputing 461:370–403

4. Alhalabi B, Gaber MM, Basura S (2021) Weights with the smallest magnitude values are set to zero micronets: a multi-phase pruning pipeline to deep ensemble learning in iot devices. Comput Electr Eng 96:107581

5. Pei SW, Wu YS, Guo J, Qiu MK (2022) Neural network pruning by recurrent weights for finance market. ACM Trans Internet Technol 22(3):56