AMSPM: Adaptive Model Selection and Partition Mechanism for Edge Intelligence-driven 5G Smart City with Dynamic Computing Resources-Reference-Cited by-同舟云学术

AMSPM: Adaptive Model Selection and Partition Mechanism for Edge Intelligence-driven 5G Smart City with Dynamic Computing Resources

Published:2024-03-16 Issue: Volume: Page:
ISSN:1550-4859
Container-title:ACM Transactions on Sensor Networks
language:en
Short-container-title:ACM Trans. Sen. Netw.

Author:

Niu Xin¹,Cao Xuejiao¹,Yu Chen¹,Jin Hai¹

Affiliation:

1. National Engineering Research Center for Big Data Technology and System, Services Computing Technology and System Lab, Cluster and Grid Computing Lab, School of Computer Science and Technology Huazhong University of Science and Technology, Wuhan, China

Abstract

With the help of 5G network, edge intelligence (EI) can not only provide distributed, low-latency, and high-reliable intelligent services, but also enable intelligent maintenance and management of smart city. However, the constantly changing available computing resources of end devices and edge servers cannot continuously guarantee the performance of intelligent inference. In order to guarantee the sustainability of intelligent services in smart city, we propose the Adaptive Model Selection and Partition Mechanism (AMSPM) in 5G smart city where EI provides services, which mainly consists of Adaptive Model Selection (AMS) and Adaptive Model Partition (AMP). In AMSPM, the model selection and partition of deep neural network (DNN) are formulated as an optimization problem. Firstly, we propose a recursive-based algorithm named AMS based on the computing resources of edge devices to derive an appropriate DNN model that satisfies the latency demand of intelligent services. Then, we adaptively partition the selected DNN model according to the computing resources of edge devices. The experimental results demonstrate that, when compared with state-of-the-art model selection and partition mechanisms, AMSPM not only reduces latency but also enhances computing resource utilization.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3652516

Reference41 articles.

1. Mahbuba Afrin, Jiong Jin, Akhlaqur Rahman, Ashfaqur Rahman, Jiafu Wan, and Ekram Hossain. 2021. Resource allocation and service provisioning in multi-agent cloud robotics: A comprehensive survey. IEEE Communications Surveys \(\& \) Tutorials 23, 2 (2021), 842–870.

2. Chanho Ahn, Eunwoo Kim, and Songhwai Oh. 2019. Deep elastic networks with model selection for multi-task learning. In Proceedings of the IEEE/CVF international conference on computer vision. 6529–6538.

3. Cost-effective ensemble models selection using deep reinforcement learning

4. Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. 2019. Once-for-All: Train One Network and Specialize it for Efficient Deployment on Diverse Hardware Platforms. In International Conference on Learning Representations.

5. Revisiting computation partitioning in future 5G-based edge computing environments;Cao Jin;IEEE Internet of Things Journal,2018