A Multi-Branch DQN-Based Transponder Resource Allocation Approach for Satellite Communications-Reference-Cited by-同舟云学术

A Multi-Branch DQN-Based Transponder Resource Allocation Approach for Satellite Communications

Published:2023-02-11 Issue:4 Volume:12 Page:916
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Sun Wenyu¹^ORCID,Zhang Weijia¹,Ma Ning¹,Jia Min²

Affiliation:

1. The 54th Research Institute of China Electronics Technology Group Corporation, Shijiazhuang 050081, China

2. School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150080, China

Abstract

In light of the increasing scarcity of frequency spectrum resources for satellite communication systems based on the transparent transponder, fast and efficient satellite resource allocation algorithms have become key to improving the overall resource occupancy. In this paper, we propose a reinforcement learning-based Multi-Branch Deep Q-Network (MBDQN), which introduces TL-Branch and RP-Branch to extract features of satellite resource pool state and task state simultaneously, and Value-Branch to calculate the action-value function. On the one hand, MBDQN improves the average resource occupancy performance (AOP) through the selection of multiple actions, including task selection and resource priority actions. On the other hand, the trained MBDQN is more suitable for online deployment and significantly reduces the runtime overhead due to the fact that MBDQN does not need iteration in the test phase. Experiments on both non-zero waste and zero waste datasets demonstrate that our proposed method achieves superior performance compared to the greedy or heuristic methods on the generated task datasets.

Funder

National Natural Science Foundation of China

Natural Science Foundation for Outstanding Young Scholars of Heilongjiang Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/4/916/pdf

Reference23 articles.

1. Intelligent resource management for satellite and terrestrial spectrum shared networking toward B5G;Jia;IEEE Wirel. Commun.,2020

2. Yanlei, D., Chunting, W., Chenhua, S., Yusheng, L., and Qing, X. (2018, January 6–9). Performance Evaluation for Satellite Communication Networks Based on AHP-BP Algorithm. Proceedings of the 2018 10th International Conference on Communication Software and Networks (ICCSN), Chengdu, China.

3. Bai, Y., Liang, C., and Chen, Q. (2022, January 19–21). Network Slice Admission Control and Resource Allocation in LEO Satellite Networks: A Robust Optimization Approach. Proceedings of the 2022 27th Asia Pacific Conference on Communications (APCC), Jeju Island, Republic of Korea.

4. Application of constraint-based satellite mission planning model in forest fire monitoring;Guo;AIP Conf. Proc.,2017

5. Lin, Z., An, K., Niu, H., Hu, Y., Chatzinotas, S., Zheng, G., and Wang, J. (IEEE Trans. Aerosp. Electron. Syst., 2022). SLNR-based secure energy efficient beamforming in Multibeam Satellite Systems, IEEE Trans. Aerosp. Electron. Syst., early access.