Deep Convolutional Neural Network Compression based on the Intrinsic Dimension of the Training Data-Reference-Cited by-同舟云学术

Deep Convolutional Neural Network Compression based on the Intrinsic Dimension of the Training Data

Published:2024-03 Issue:1 Volume:24 Page:14-23
ISSN:1559-6915
Container-title:ACM SIGAPP Applied Computing Review
language:en
Short-container-title:SIGAPP Appl. Comput. Rev.

Author:

Hadi Abir Mohammad¹,Won Kwanghee¹

Affiliation:

1. South Dakota State University, Brookings, South Dakota, USA

Abstract

Selecting the optimal deep learning architecture for a particular task and dataset remains an ongoing challenge. Typically, this decision-making process involves exhaustive searches for neural network architectures or multi-phase optimization, which includes initial training, compression or pruning, and fine-tuning steps. In this study, we introduce an approach utilizing a deep reinforcement learning-based agent to dynamically compress a deep convolutional neural network throughout its training process. We integrate the concept of the intrinsic dimension of the training data to provide the agent with insights into the task's complexity. The agent employs two distinct ranking criteria, L1-norm-based and attention-based measures, to selectively prune filters from each layer as it determines necessary. In the experiments, we used the CIFAR-10 dataset and its subsets (2-class and 5-class subsets) to model the task complexity and showed that the agent learns different policies depending on the intrinsic dimension. The agent, on average, pruned off 78.48%, 77.9%, and 83.12% filters from all the layers of VGG-16 network for CIFAR-10 full, 5-class, and 2-class subsets respectively.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3663652.3663654

Reference34 articles.

1. Estimating Local Intrinsic Dimensionality

2. Using filter banks in Convolutional Neural Networks for texture classification

3. Structured Pruning of Deep Convolutional Neural Networks

4. Neuroplasticity-Based Pruning Method for Deep Convolutional Neural Networks;Camacho Jose David;Applied Sciences,2022

5. FPC: Filter pruning via the contribution of output feature map for deep convolutional neural networks acceleration