Allocation method of communication interference resource based on deep reinforcement learning of maximum policy entropy-Reference-Cited by-同舟云学术

Allocation method of communication interference resource based on deep reinforcement learning of maximum policy entropy

Published:2021-10 Issue:5 Volume:39 Page:1077-1086
ISSN:1000-2758
Container-title:Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
language:
Short-container-title:西北工业大学学报

Author:

RAO Ning,XU Hua,QI Zisen,SONG Bailin,SHI Yunhao

Abstract

In order to solve the optimization of the interference resource allocation in communication network countermeasures, an interference resource allocation method based on the maximum policy entropy deep reinforcement learning (MPEDRL) was proposed. The method introduced the idea of deep reinforcement learning into the communication countermeasures resource allocation, it could enhance the exploration of the policy and accelerate the convergence to the global optimum with adding the maximum policy entropy criterion and adaptively adjusting the entropy coefficient. The method modeled interference resource allocation as Markov decision process, then established the interference strategy network to output allocation scheme, constructing the interference effect evaluation network of the clipped twin structure for efficiency evaluation, and trained the policy network and the evaluation network with the goal of maximizing the strategy entropy and the cumulative interference efficacy, then decided the optimal interference resource allocation scheme. The simulation results show that the algorithm can effectively solve the resource allocation problem in communication network confrontation, comparing with the existing deep reinforcement learning methods, it has faster learning speed and less fluctuation in the training process, and achieved 15% higher jamming efficacy than DDPG-based method.

Publisher

EDP Sciences

Subject

General Engineering

Link

https://www.jnwpu.org/10.1051/jnwpu/20213951077/pdf

Reference24 articles.

1. Frequency Hopping Sequences With Optimal Partial Hamming Correlation

2. Wang X J, Lei M J, Zhao M J, et al. Cooperative anti-jamming strategy and outage probability optimization for multi-hop ad-hoc networks[C]//2017 IEEE 86th Vehicular Technology Conference, 2017: 24–27

3. Sun J, Li X. Carrier frequency offset synchronization algorithm for short burst communication system[C]//Proceedings of 2016 IEEE 13th International Conference on Signal Processing, 2016: 6–10

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Carrier Adjustment Algorithm for Mobile Communication Network Based on Deep Reinforcement Learning;Lecture Notes in Networks and Systems;2024

2. Adaptive Optimization Design of Building Energy System for Smart Elderly Care Community Based on Deep Deterministic Policy Gradient;Processes;2023-07-19

3. A Deep Reinforcement Learning Communication Jamming Resource Allocation Algorithm Fused with Noise Network;J ELECTRON INF TECHN;2023

4. A DRL-Based Intelligent Jamming Approach for Joint Channel and Power Optimization;Wireless Communications and Mobile Computing;2023-02-06

5. A hierarchical comb interference resource allocation algorithm based on greedy strategy and evolutionary algorithm;2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP);2022-04-15