Cost-Efficient Deep Neural Network Placement in Edge Intelligence-Enabled Internet of Things

Author:

Tian Hao12ORCID,Xu Xiaolong2ORCID,Wu Hongyue3ORCID,Zhao Qingzhan4ORCID,Dai Jianguo4ORCID,Khan Maqbool5ORCID

Affiliation:

1. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu, China, Nanjing, China

2. School of Software, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, China, Nanjing, China

3. College of Intelligence and Computing, Tianjin University, Tianjin, China, Tianjin, China

4. Geospatial Information Engineering Research Center, College of Information Science and Technology, Shihezi University, Shihezi, Xinjiang, China, Shihezi, China

5. Department of IT and Computer Science, Pak-Austria Fachhochschule Institute of Applied Sciences and Technology, Haripur, Pakistan, Haripur, Pakistan

Abstract

Edge intelligence (EI) integrates edge computing and artificial intelligence empowering service providers to deploy deep neural networks (DNNs) on edge servers in proximity to users to provision intelligent applications (e.g., autonomous driving) for ubiquitous Internet of Things (IoT) in smart cities, which facilitates the quality of experience (QoE) of users and improves the processing and energy efficiency. However, considering DNN is typically computational-intensive and resource-hungry, conventional placement approaches ignore the influence of multi-dimensional resource requirements (processor, memory, etc.), which may degrade the real-time performance. Moreover, with the increasing scale of geo-distributed edge servers, centralized decision-making is still challenging to find the optimal strategies effectively. To overcome these shortcomings, in this paper we propose a game theoretic DNN placement approach in EI-enabled IoT. First, a DNN placement optimization problem is formulated to maximize system benefits, which is proven to be \(\mathcal {N}\mathcal {P} \) -hard and model the original problem as an exact potential game (EPG). Moreover, an EP G-based DNN m o del p l acement algorithm, named EPOL, is designed for edge servers to make sub-optimal strategies independently and theoretical analysis is possessed to guarantee the performance of EPOL. Finally, real-world dataset based experimental results corroborate the superiority and effectiveness of EPOL.

Publisher

Association for Computing Machinery (ACM)

Reference47 articles.

1. Minimum Overhead Beamforming and Resource Allocation in D2D Edge Networks

2. Federated distillation and blockchain empowered secure knowledge sharing for Internet of medical Things

3. Yuanming Ren, Shihao Shen, Yanli Ju, Xiaofei Wang, Wenyu Wang, and Victor CM Leung. 2022. EdgeMatrix: A Resources Redefined Edge-Cloud System for Prioritized Services. In IEEE INFOCOM 2022-IEEE Conference on Computer Communications. IEEE, 610–619.

4. A Survey on Vision Transformer

5. Edge Computing with Artificial Intelligence: A Machine Learning Perspective

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3