Memory Optimization Techniques in Neural Networks: A Review-Reference-Cited by-同舟云学术

Memory Optimization Techniques in Neural Networks: A Review

Published:2021-08-30 Issue:6 Volume:10 Page:44-48
ISSN:2249-8958
Container-title:International Journal of Engineering and Advanced Technology
language:
Short-container-title:IJEAT

Author:

P Pratheeksha, ,M Pranav B,Nasreen Dr. Azra, ,

Abstract

Deep neural networks have been continuously evolving towards larger and more complex models to solve challenging problems in the field of AI. The primary bottleneck that restricts new network architectures is memory consumption. Running or training DNNs heavily relies on the hardware (CPUs, GPUs, or FPGA) which are either inadequate in terms of memory or hard-to-extend. This would further make it difficult to scale. In this paper, we review some of the latest memory footprint reduction techniques which would enable faster low model complexity. Additionally, it improves accuracy by increasing the batch size and developing wider and deeper neural networks with the same set of hardware resources. The paper emphasizes on memory optimization methods specific to CNN and RNN training.

Publisher

Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP

Subject

Computer Science Applications,General Engineering,Environmental Engineering

Reference22 articles.

1. Yanjie Gao, Yu Liu, Hongyu Zhang, Zhengxian Li, Yonghao Zhu, Haoxiang Lin, Mao Yang, "Estimating GPU Memory Consumption of Deep Learning Models", 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 1342-1352, Nov 2020.

2. Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally, "EIE: Efficient Inference Engine on Compressed Deep Neural Network", IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), vol. 44, no. 3, pp. 243-254, June 2016

3. Song Han, Huizi Mao, William J. Dally, "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding", arXiv:1510.00149 [cs.CV], Feb 2016.

4. Nimit S. Sohoni, Christopher R. Aberger, Megan Leszczynski, Jian Zhang, Christopher R'e, "Low-Memory Neural Network Training: A Technical Report", arXiv:1904.10631 [cs.LG], Apr 2019.

5. Aashaka Shah, Chao-Yuan Wu, Jayashree Mohan, Vijay Chidambaram, Philipp Krähenbühl, "Memory Optimization for Deep Networks", arXiv:2010.14501 [cs.LG], Oct 2020.