Deep Learning on Mobile and Embedded Devices-Reference-Cited by-同舟云学术

Deep Learning on Mobile and Embedded Devices

Published:2021-07-31 Issue:4 Volume:53 Page:1-37
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Chen Yanjiao¹^ORCID,Zheng Baolin¹,Zhang Zihan¹,Wang Qian¹,Shen Chao²^ORCID,Zhang Qian³

Affiliation:

1. Wuhan University, Wuhan, China

2. Xi’an Jiaotong University, China

3. Hong Kong University of Science and Technology, China

Abstract

Recent years have witnessed an exponential increase in the use of mobile and embedded devices. With the great success of deep learning in many fields, there is an emerging trend to deploy deep learning on mobile and embedded devices to better meet the requirement of real-time applications and user privacy protection. However, the limited resources of mobile and embedded devices make it challenging to fulfill the intensive computation and storage demand of deep learning models. In this survey, we conduct a comprehensive review on the related issues for deep learning on mobile and embedded devices. We start with a brief introduction of deep learning and discuss major challenges of implementing deep learning models on mobile and embedded devices. We then conduct an in-depth survey on important compression and acceleration techniques that help adapt deep learning models to mobile and embedded devices, which we specifically classify as pruning, quantization, model distillation, network design strategies, and low-rank factorization. We elaborate on the hardware-based solutions, including mobile GPU, FPGA, and ASIC, and describe software frameworks for mobile deep learning models, especially the development of frameworks based on OpenCL and RenderScript. After that, we present the application of mobile deep learning in a variety of areas, such as navigation, health, speech recognition, and information security. Finally, we discuss some future directions for deep learning on mobile and embedded devices to inspire further research in this area.

Funder

National Natural Science Foundation of China

Equipment Pre-research Joint Fund

Outstanding Youth Foundation of Hubei Province

Wuhan Advanced Application Project

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3398209

Reference214 articles.

1. Hisilicon. 2018. Hikey 970. Retrieved from https://www.96boards.org/product/hikey970/. Hisilicon. 2018. Hikey 970. Retrieved from https://www.96boards.org/product/hikey970/.

2. Deep Learning with Differential Privacy

3. Fused-layer CNN accelerators

Cited by 71 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Rapid and portable quantification of HIV RNA via a smartphone-enabled digital CRISPR device and deep learning;Sensors and Actuators Reports;2024-12

2. Automatic motion artifact detection in electrodermal activity signals using 1D U-net architecture;Computers in Biology and Medicine;2024-11

3. A comprehensive review of model compression techniques in machine learning;Applied Intelligence;2024-09-02

4. ReCTSi: Resource-efficient Correlated Time Series Imputation via Decoupled Pattern Learning and Completeness-aware Attentions;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

5. Assessing the Efficacy of TinyML Implementations on STM32 Microcontrollers: A Performance Evaluation Study;2024 IEEE 7th International Conference on Advanced Technologies, Signal and Image Processing (ATSIP);2024-07-11