Where to Prune: Using LSTM to Guide End-to-end Pruning-Reference-Cited by-同舟云学术

Where to Prune: Using LSTM to Guide End-to-end Pruning

Published:2018-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Zhong Jing¹,Ding Guiguang¹,Guo Yuchen¹,Han Jungong²,Wang Bin¹

Affiliation:

1. Beijing National Laboratory for Information Science and Technology (BNList), School of Software, Tsinghua University, Beijing 100084, China

2. School of Computing & Communications, Lancaster University, UK

Abstract

Recent years have witnessed the great success of convolutional neural networks (CNNs) in many related fields. However, its huge model size and computation complexity bring in difficulty when deploying CNNs in some scenarios, like embedded system with low computation power. To address this issue, many works have been proposed to prune filters in CNNs to reduce computation. However, they mainly focus on seeking which filters are unimportant in a layer and then prune filters layer by layer or globally. In this paper, we argue that the pruning order is also very significant for model pruning. We propose a novel approach to figure out which layers should be pruned in each step. First, we utilize a long short-term memory (LSTM) to learn the hierarchical characteristics of a network and generate a pruning decision for each layer, which is the main difference from previous works. Next, a channel-based method is adopted to evaluate the importance of filters in a to-be-pruned layer, followed by an accelerated recovery step. Experimental results demonstrate that our approach is capable of reducing 70.1% FLOPs for VGG and 47.5% for Resnet-56 with comparable accuracy. Also, the learning results seem to reveal the sensitivity of each network layer.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparative Study of Pruning Techniques in Recurrent Neural Networks;Advances in Data-driven Computing and Intelligent Systems;2023

2. LAP: Latency-aware automated pruning with dynamic-based filter selection;Neural Networks;2022-08

3. Revisiting a kNN-Based Image Classification System with High-Capacity Storage;Lecture Notes in Computer Science;2022

4. Using Artificial Neural Network Condensation to Facilitate Adaptation of Machine Learning in Medical Settings by Reducing Computational Burden: Model Design and Evaluation Study;JMIR Formative Research;2021-12-08

5. Deep Learning on Mobile and Embedded Devices;ACM Computing Surveys;2021-07-31