Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning-Reference-Cited by-同舟云学术

Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning

Published:2024-05-29 Issue:11 Volume:16 Page:1955
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Xin Qi¹^ORCID,Xin Zengxian²,Chen Tao¹

Affiliation:

1. College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China

2. Shanghai Radio Equipment Research Institute, Shanghai 201109, China

Abstract

Addressing the formidable challenges posed by multiple jammers jamming multiple radars, which arise from spatial discretization, many degrees of freedom, numerous model input parameters, and the complexity of constraints, along with a multi-peaked objective function, this paper proposes a cooperative jamming resource allocation method, based on evolutionary reinforcement learning, that uses joint multi-domain information. Firstly, an adversarial scenario model is established, characterizing the interaction between multiple jammers and radars based on a multi-beam jammer model and a radar detection model. Subsequently, considering real-world scenarios, this paper analyzes the constraints and objective function involved in cooperative jamming resource allocation by multiple jammers. Finally, accounting for the impact of spatial, frequency, and energy domain information on jamming resource allocation, matrices representing spatial condition constraints, jamming beam allocation, and jamming power allocation are formulated to characterize the cooperative jamming resource allocation problem. Based on this foundation, the joint allocation of the jamming beam and jamming power is optimized under the constraints of jamming resources. Through simulation experiments, it was determined that, compared to the dung beetle optimizer (DBO) algorithm and the particle swarm optimization (PSO) algorithm, the proposed evolutionary reinforcement learning algorithm based on DBO and Q-Learning (DBO-QL) offers 3.03% and 6.25% improvements in terms of jamming benefit and 26.33% and 50.26% improvements in terms of optimization success rate, respectively. In terms of algorithm response time, the proposed hybrid DBO-QL algorithm has a response time of 0.11 s, which is 97.35% and 96.57% lower than the response times of the DBO and PSO algorithms, respectively. The results show that the method proposed in this paper has good convergence, stability, and timeliness.

Funder

Shanghai Aerospace Science and Technology Innovation Fund

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/11/1955/pdf

Reference40 articles.

1. An overview of cognitive radar: Past, present, and future;Gurbuz;IEEE Aerosp. Electron. Syst. Mag.,2019

2. Haykin, S. (2010, January 10–14). New generation of radar systems enabled with cognition. Proceedings of the 2010 IEEE Radar Conference, Arlington, VA, USA.

3. Cognitive radar: A way of the future;Haykin;IEEE Signal Process. Mag.,2006

4. Darpa, A. (2010). Behavioral learning for adaptive electronic warfare. Darpa-BAA-10-79, Defense Advanced Research Projects Agency.

5. DARPA seeks proposals for adaptive radar countermeasures;Haystead;J. Electron. Def.,2012

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Distributed Communication Interference Resource Scheduling using the Master-Slave Parallel Scheduling Genetic Algorithm;2024-08-23