Reinforcement Learning-Based Resource Allocation and Energy Efficiency Optimization for a Space–Air–Ground-Integrated Network-Reference-Cited by-同舟云学术

Reinforcement Learning-Based Resource Allocation and Energy Efficiency Optimization for a Space–Air–Ground-Integrated Network

Published:2024-05-06 Issue:9 Volume:13 Page:1792
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Chen Zhiyu¹,Zhou Hongxi¹,Du Siyuan²,Liu Jiayan²,Zhang Luyang²,Liu Qi³

Affiliation:

1. State Grid Information & Telecommunication Branch, Beijing 100761, China

2. School of Electrical and Electronic Engineering, North China Electric Power University, Beijing 102206, China

3. Beijing Fibrlink Communications Co., Ltd., Beijing 100071, China

Abstract

With the construction and development of the smart grid, the power business puts higher requirements on the communication capability of the network. In order to improve the energy efficiency of the space–air–ground-integrated power three-dimensional fusion communication network, we establish an optimization problem for joint air platform (AP) flight path selection, ground power facility (GPF) association, and power control. In solving the problem, we decompose the problem into two subproblems, one is the AP flight path selection subproblem and the other is the GPF association and power control subproblem. Firstly, based on the GPF distribution and throughput weights, we model the AP flight path selection subproblem as a Markov Decision Process (MDP) and propose a multi-agent iterative optimization algorithm based on the comprehensive judgment of GPF positions and workload. Secondly, we model the GPF association and power control subproblem as a multi-agent, time-varying K-armed bandit model and propose an algorithm based on multi-agent Temporal Difference (TD) learning. Then, by alternately iterating between the two subproblems, we propose a reinforcement learning (RL)-based joint optimization algorithm. Finally, the simulation results indicate that compared to the three baseline algorithms (random path, average transmit power, and random device association), the proposed algorithm improves an overall energy efficiency of the system of 16.23%, 86.29%, and 5.11% under various conditions (including different noise power levels, GPF bandwidth, and GPF quantities), respectively.

Funder

Science and Technology Foundation of the State Grid Corporation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/9/1792/pdf

Reference24 articles.

1. On the Road to 6G: Visions, Requirements, Key Technologies, and Testbeds;Siasos;IEEE Commun. Surv. Tutor.,2023

2. Joint UAV Hovering Altitude and Power Control for Space-Air-Ground IoT Networks;Wang;IEEE Int. Things J.,2019

3. Review of Internet of Things (IoT) in Electric Power and Energy Systems;Bedi;IEEE Int. Things J.,2018

4. Joint Access and Backhaul Resource Management in Satellite-Drone Networks: A Competitive Market Approach;Hu;IEEE Trans. Wirel. Commun.,2020

5. Space-air-ground integrated network (SAGIN) for 6G: Requirements, architecture and challenges;Cui;China Commun.,2022