Literature Review of Deep Network Compression-Reference-Cited by-同舟云学术

Literature Review of Deep Network Compression

Published:2021-11-17 Issue:4 Volume:8 Page:77
ISSN:2227-9709
Container-title:Informatics
language:en
Short-container-title:Informatics

Author:

Alqahtani Ali^ORCID,Xie Xianghua^ORCID,Jones Mark W.^ORCID

Abstract

Deep networks often possess a vast number of parameters, and their significant redundancy in parameterization has become a widely-recognized property. This presents significant challenges and restricts many deep learning applications, making the focus on reducing the complexity of models while maintaining their powerful performance. In this paper, we present an overview of popular methods and review recent works on compressing and accelerating deep neural networks. We consider not only pruning methods but also quantization methods, and low-rank factorization methods. This review also intends to clarify these major concepts, and highlights their characteristics, advantages, and shortcomings.

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction,Communication

Link

https://www.mdpi.com/2227-9709/8/4/77/pdf

Reference76 articles.

1. Deep Learning;Goodfellow,2016

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Pruning Policy for Image Classification Problems Based on Deep Learning;Informatics;2024-09-12

2. Effect of Post-Training Pruning and Quantization on Endoscopic Computer-Aided Diagnosis Models;2024 IEEE International Symposium on Biomedical Imaging (ISBI);2024-05-27

3. Pruning techniques for artificial intelligence networks: a deeper look at their engineering design and bias: the first review of its kind;Multimedia Tools and Applications;2024-05-10

4. Efficient Bayesian CNN Model Compression using Bayes by Backprop and L1-Norm Regularization;Neural Processing Letters;2024-04-04

5. Containerization in Edge Intelligence: A Review;Electronics;2024-04-02