DASS: Differentiable Architecture Search for Sparse Neural Networks-Reference-Cited by-同舟云学术

DASS: Differentiable Architecture Search for Sparse Neural Networks

Published:2023-09-09 Issue:5s Volume:22 Page:1-21
ISSN:1539-9087
Container-title:ACM Transactions on Embedded Computing Systems
language:en
Short-container-title:ACM Trans. Embed. Comput. Syst.

Author:

Mousavi Hamid¹^ORCID,Loni Mohammad¹^ORCID,Alibeigi Mina²^ORCID,Daneshtalab Masoud³^ORCID

Affiliation:

1. School of Innovation, Design and Engineering, Mälardalen University, Sweden

2. Zenseact AB, Lindholmspiren 2, Sweden

3. School of Innovation, Design and Engineering, Mälardalen University,Sweden and Computer systems, Tallinn University of Technology, Estonia

Abstract

The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available computational power. While recent research has made significant strides in developing pruning methods to build a sparse network for reducing the computing overhead of DNNs, there remains considerable accuracy loss, especially at high pruning ratios. We find that the architectures designed for dense networks by differentiable architecture search methods are ineffective when pruning mechanisms are applied to them. The main reason is that the current methods do not support sparse architectures in their search space and use a search objective that is made for dense networks and does not focus on sparsity. This paper proposes a new method to search for sparsity-friendly neural architectures. It is done by adding two new sparse operations to the search space and modifying the search objective. We propose two novel parametric SparseConv and SparseLinear operations in order to expand the search space to include sparse operations. In particular, these operations make a flexible search space due to using sparse parametric versions of linear and convolution operations. The proposed search objective lets us train the architecture based on the sparsity of the search space operations. Quantitative analyses demonstrate that architectures found through DASS outperform those used in the state-of-the-art sparse networks on the CIFAR-10 and ImageNet datasets. In terms of performance and hardware effectiveness, DASS increases the accuracy of the sparse version of MobileNet-v2 from 73.44% to 81.35% (+7.91% improvement) with a 3.87× faster inference time.

Funder

European Union through European Social Fund in the frames of the “Information and Communication Technologies (ICT) program”

Swedish Innovation Agency VINNOVA project “AutoDeep”, “SafeDeep”, and “KKS DPAC”

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3609385

Reference92 articles.

1. Faster r-cnn: Towards real-time object detection with region proposal networks;Ren Shaoqing;Advances in Neural Information Processing Systems,2015

2. Deep Residual Learning for Image Recognition

3. Deep learning for computer vision: A brief review;Voulodimos Athanasios;Computational Intelligence and Neuroscience,2018

4. How convolutional neural network see the world-A survey of convolutional neural network visualization methods;Qin Zhuwei;arXiv preprint arXiv:1804.11191,2018

5. AI and memory wall;Gholami Amir;RiseLab Medium Post,2021

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient one-shot Neural Architecture Search with progressive choice freezing evolutionary search;Neurocomputing;2024-09

2. A memristive all-inclusive hypernetwork for parallel analog deployment of full search space architectures;Neural Networks;2024-07

3. OnceNAS: Discovering efficient on-device inference neural networks for edge devices;Information Sciences;2024-05

4. DARTS-PT-CORE: Collaborative and Regularized Perturbation-based Architecture Selection for differentiable NAS;Neurocomputing;2024-05

5. Leveraging Text-to-Text Pretrained Language Models for Question Answering in Chemistry;ACS Omega;2024-03-12