Computation Offloading and Resource Allocation Based on P-DQN in LEO Satellite Edge Networks-Reference-Cited by-同舟云学术

Computation Offloading and Resource Allocation Based on P-DQN in LEO Satellite Edge Networks

Published:2023-12-17 Issue:24 Volume:23 Page:9885
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Yang Xu¹,Fang Hai¹^ORCID,Gao Yuan¹,Wang Xingjie²^ORCID,Wang Kan²,Liu Zheng²

Affiliation:

1. Xi’an Institute of Space Radio Technology, Xi’an 710100, China

2. School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China

Abstract

Traditional low earth orbit (LEO) satellite networks are typically independent of terrestrial networks, which develop relatively slowly due to the on-board capacity limitation. By integrating emerging mobile edge computing (MEC) with LEO satellite networks to form the business-oriented “end-edge-cloud” multi-level computing architecture, some computing-sensitive tasks can be offloaded by ground terminals to satellites, thereby satisfying more tasks in the network. How to make computation offloading and resource allocation decisions in LEO satellite edge networks, nevertheless, indeed poses challenges in tracking network dynamics and handling sophisticated actions. For the discrete-continuous hybrid action space and time-varying networks, this work aims to use the parameterized deep Q-network (P-DQN) for the joint computation offloading and resource allocation. First, the characteristics of time-varying channels are modeled, and then both communication and computation models under three different offloading decisions are constructed. Second, the constraints on task offloading decisions, on remaining available computing resources, and on the power control of LEO satellites as well as the cloud server are formulated, followed by the maximization problem of satisfied task number over the long run. Third, using the parameterized action Markov decision process (PAMDP) and P-DQN, the joint computing offloading, resource allocation, and power control are made in real time, to accommodate dynamics in LEO satellite edge networks and dispose of the discrete-continuous hybrid action space. Simulation results show that the proposed P-DQN method could approach the optimal control, and outperforms other reinforcement learning (RL) methods for merely either discrete or continuous action space, in terms of the long-term rate of satisfied tasks.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Natural Science Foundation of Shaanxi Province of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/24/9885/pdf

Reference52 articles.

1. LEO satellite constellation for Internet of Things;Qu;IEEE Access,2017

2. Heterogeneous space and terrestrial integrated networks for IoT: Architecture and challenges;Chien;IEEE Netw.,2019

3. System integration of terrestrial mobile communication and satellite communication—The trends, challenges and key technologies in B5G and 6G;Chen;China Commun.,2020

4. Vision, requirements, and technology trend of 6G: How to tackle the challenges of system coverage, capacity, user data-rate and movement speed;Chen;IEEE Wirel. Commun.,2020

5. SFC-based service provisioning for reconfigurable space-air-ground integrated networks;Wang;IEEE J. Sel. Areas Commun.,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Reinforcement Learning based Mobility Management in a MEC-Enabled Cellular IoT Network;Pervasive and Mobile Computing;2024-09