Affiliation:
1. Wuhan University, Wuhan, China
2. Xi’an Jiaotong University, China
3. Hong Kong University of Science and Technology, China
Abstract
Recent years have witnessed an exponential increase in the use of mobile and embedded devices. With the great success of deep learning in many fields, there is an emerging trend to deploy deep learning on mobile and embedded devices to better meet the requirement of real-time applications and user privacy protection. However, the limited resources of mobile and embedded devices make it challenging to fulfill the intensive computation and storage demand of deep learning models. In this survey, we conduct a comprehensive review on the related issues for deep learning on mobile and embedded devices. We start with a brief introduction of deep learning and discuss major challenges of implementing deep learning models on mobile and embedded devices. We then conduct an in-depth survey on important compression and acceleration techniques that help adapt deep learning models to mobile and embedded devices, which we specifically classify as pruning, quantization, model distillation, network design strategies, and low-rank factorization. We elaborate on the hardware-based solutions, including mobile GPU, FPGA, and ASIC, and describe software frameworks for mobile deep learning models, especially the development of frameworks based on OpenCL and RenderScript. After that, we present the application of mobile deep learning in a variety of areas, such as navigation, health, speech recognition, and information security. Finally, we discuss some future directions for deep learning on mobile and embedded devices to inspire further research in this area.
Funder
National Natural Science Foundation of China
Equipment Pre-research Joint Fund
Outstanding Youth Foundation of Hubei Province
Wuhan Advanced Application Project
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference214 articles.
1. Hisilicon. 2018. Hikey 970. Retrieved from https://www.96boards.org/product/hikey970/. Hisilicon. 2018. Hikey 970. Retrieved from https://www.96boards.org/product/hikey970/.
2. Deep Learning with Differential Privacy
3. Fused-layer CNN accelerators
Cited by
71 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献