Abstract
We present SensiX++, a multi-tenant runtime for adaptive model execution with integrated MLOps on edge devices, e.g., a camera, a microphone, or IoT sensors. SensiX++ operates on two fundamental principles: highly modular componentisation to externalise data operations with clear abstractions and document-centric manifestation for system-wide orchestration. First, a data coordinator manages the lifecycle of sensors and serves models with correct data through automated transformations. Next, a resource-aware model server executes multiple models in isolation through model abstraction, pipeline automation, and feature sharing. An adaptive scheduler then orchestrates the best-effort executions of multiple models across heterogeneous accelerators, balancing latency and throughput. Finally, microservices with REST APIs serve synthesised model predictions, system statistics, and continuous deployment. Collectively, these components enable SensiX++ to serve multiple models efficiently with fine-grained control on edge devices while minimising data operation redundancy, managing data and device heterogeneity, and reducing resource contention. We benchmark SensiX++ with 10 different vision and acoustics models across various multi-tenant configurations on different edge accelerators (Jetson AGX and Coral TPU) designed for sensory devices. We report on the overall throughput and quantified benefits of various automation components of SensiX++ and demonstrate its efficacy in significantly reducing operational complexity and lowering the effort to deploy, upgrade, reconfigure, and serve embedded models on edge devices.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference64 articles.
1. 2021. BentoML. (2021). Retrieved August 8 2023 from https://www.bentoml.ai
2. 2021. Coral Keyphrase Detector. (2021). Retrieved August 8 2023 from https://github.com/google-coral/project-keyword-spotter
3. 2021. ElectrifAI. (2021). Retrieved August 8 2023 from https://electrifai.net
4. 2021. Emotion Classification. (2021). Retrieved August 8 2023 from https://github.com/Data-Science-kosta/Speech-Emotion-Classification-with-PyTorch/
5. 2021. KubeFlow. (2021). Retrieved August 8 2023 from https://www.kubeflow.org