Adaptive deep learning model selection on embedded systems-Reference-Cited by-同舟云学术

Adaptive deep learning model selection on embedded systems

Published:2018-12-07 Issue:6 Volume:53 Page:31-43
ISSN:0362-1340
Container-title:ACM SIGPLAN Notices
language:en
Short-container-title:SIGPLAN Not.

Author:

Taylor Ben¹,Marco Vicent Sanz¹,Wolff Willy¹,Elkhatib Yehia¹,Wang Zheng¹

Affiliation:

1. Lancaster University, UK

Abstract

The recent ground-breaking advances in deep learning networks (DNNs) make them attractive for embedded systems. However, it can take a long time for DNNs to make an inference on resource-limited embedded devices. Offloading the computation into the cloud is often infeasible due to privacy concerns, high latency, or the lack of connectivity. As such, there is a critical need to find a way to effectively execute the DNN models locally on the devices. This paper presents an adaptive scheme to determine which DNN model to use for a given input, by considering the desired accuracy and inference time. Our approach employs machine learning to develop a predictive model to quickly select a pre-trained DNN to use for a given input and the optimization constraint. We achieve this by first training off-line a predictive model, and then use the learnt model to select a DNN model to use for new, unseen inputs. We apply our approach to the image classification task and evaluate it on a Jetson TX2 embedded deep learning platform using the ImageNet ILSVRC 2012 validation dataset. We consider a range of influential DNN models. Experimental results show that our approach achieves a 7.52% improvement in inference accuracy, and a 1.8x reduction in inference time over the most-capable single DNN model.

Funder

Engineering and Physical Sciences Research Council

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3299710.3211336

Reference62 articles.

1. JJ Allaire Dirk Eddelbuettel Nick Golding and Yuan Tang. 2016. TensorFlow for R. https://tensorflow.rstudio.com/ JJ Allaire Dirk Eddelbuettel Nick Golding and Yuan Tang. 2016. TensorFlow for R. https://tensorflow.rstudio.com/

2. Dario Amodei et al. 2016. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. In ICML ’16. Dario Amodei et al. 2016. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. In ICML ’16.

3. Dzmitry Bahdanau et al. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014). Dzmitry Bahdanau et al. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).

4. Sparsification and Separation of Deep Learning Layers for Constrained Resource Inference on Wearables

5. Alfredo Canziani Adam Paszke and Eugenio Culurciello. 2016. An Analysis of Deep Neural Network Models for Practical Applications. CoRR (2016). Alfredo Canziani Adam Paszke and Eugenio Culurciello. 2016. An Analysis of Deep Neural Network Models for Practical Applications. CoRR (2016).

Cited by 46 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Edge computing in future wireless networks: A comprehensive evaluation and vision for 6G and beyond;ICT Express;2024-08

2. Multi-Dimensional QoS Evaluation and Optimization of Mobile Edge Computing for IoT: A Survey;Chinese Journal of Electronics;2024-07

3. Adapting Neural Networks at Runtime: Current Trends in At-Runtime Optimizations for Deep Learning;ACM Computing Surveys;2024-05-14

4. Artificial intelligence and edge computing for machine maintenance-review;Artificial Intelligence Review;2024-04-15

5. DiTMoS: Delving into Diverse Tiny-Model Selection on Microcontrollers;2024 IEEE International Conference on Pervasive Computing and Communications (PerCom);2024-03-11