On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures-Reference-Cited by-同舟云学术

On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures

Published:2021-03-15 Issue:6 Volume:10 Page:689
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Springer Tom,Eiroa-Lledo Elia^ORCID,Stevens Elizabeth,Linstead Erik^ORCID

Abstract

As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained. The realization of machine learning, and deep learning, is being driven by the availability of specialized hardware, such as system-on-chip solutions, which provide some alleviation of constraints. Equally important, however, are the operating systems that run on this hardware, and specifically the ability to leverage commercial real-time operating systems which, unlike general purpose operating systems such as Linux, can provide the low-latency, deterministic execution required for embedded, and potentially safety-critical, applications at the edge. Despite this, studies considering the integration of real-time operating systems, specialized hardware, and machine learning/deep learning algorithms remain limited. In particular, better mechanisms for real-time scheduling in the context of machine learning applications will prove to be critical as these technologies move to the edge. In order to address some of these challenges, we present a resource management framework designed to provide a dynamic on-device approach to the allocation and scheduling of limited resources in a real-time processing environment. These types of mechanisms are necessary to support the deterministic behavior required by the control components contained in the edge nodes. To validate the effectiveness of our approach, we applied rigorous schedulability analysis to a large set of randomly generated simulated task sets and then verified the most time critical applications, such as the control tasks which maintained low-latency deterministic behavior even during off-nominal conditions. The practicality of our scheduling framework was demonstrated by integrating it into a commercial real-time operating system (VxWorks) then running a typical deep learning image processing application to perform simple object detection. The results indicate that our proposed resource management framework can be leveraged to facilitate integration of machine learning algorithms with real-time operating systems and embedded platforms, including widely-used, industry-standard real-time operating systems.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/6/689/pdf

Reference36 articles.

1. On-Device Machine Learning: An Algorithms and Learning Theory Perspective;Dhar;arXiv,2019

2. Efficient Processing of Deep Neural Networks: A Tutorial and Survey

3. Bring Deep-Learning Inference to Embedded Applicationshttps://www.electronicdesign.com/industrial-automation/article/21808380/bring-deeplearning-inference-to-embedded-applications

4. The Memory Challenge in Ultra-Low Power Deep Learning;Conti,2020

5. On-Chip Error-Triggered Learning of Multi-Layer Memristive Spiking Neural Networks

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic Segmentation Network Slimming and Edge Deployment for Real-Time Forest Fire or Flood Monitoring Systems Using Unmanned Aerial Vehicles;Electronics;2023-11-27

2. Deep Neural Network: An Alternative to Traditional Channel Estimators in Massive MIMO Systems;IEEE Transactions on Cognitive Communications and Networking;2022-06