Affiliation:
1. College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China
2. Shanghai Radio Equipment Research Institute, Shanghai 201109, China
Abstract
Addressing the formidable challenges posed by multiple jammers jamming multiple radars, which arise from spatial discretization, many degrees of freedom, numerous model input parameters, and the complexity of constraints, along with a multi-peaked objective function, this paper proposes a cooperative jamming resource allocation method, based on evolutionary reinforcement learning, that uses joint multi-domain information. Firstly, an adversarial scenario model is established, characterizing the interaction between multiple jammers and radars based on a multi-beam jammer model and a radar detection model. Subsequently, considering real-world scenarios, this paper analyzes the constraints and objective function involved in cooperative jamming resource allocation by multiple jammers. Finally, accounting for the impact of spatial, frequency, and energy domain information on jamming resource allocation, matrices representing spatial condition constraints, jamming beam allocation, and jamming power allocation are formulated to characterize the cooperative jamming resource allocation problem. Based on this foundation, the joint allocation of the jamming beam and jamming power is optimized under the constraints of jamming resources. Through simulation experiments, it was determined that, compared to the dung beetle optimizer (DBO) algorithm and the particle swarm optimization (PSO) algorithm, the proposed evolutionary reinforcement learning algorithm based on DBO and Q-Learning (DBO-QL) offers 3.03% and 6.25% improvements in terms of jamming benefit and 26.33% and 50.26% improvements in terms of optimization success rate, respectively. In terms of algorithm response time, the proposed hybrid DBO-QL algorithm has a response time of 0.11 s, which is 97.35% and 96.57% lower than the response times of the DBO and PSO algorithms, respectively. The results show that the method proposed in this paper has good convergence, stability, and timeliness.
Funder
Shanghai Aerospace Science and Technology Innovation Fund
Reference40 articles.
1. An overview of cognitive radar: Past, present, and future;Gurbuz;IEEE Aerosp. Electron. Syst. Mag.,2019
2. Haykin, S. (2010, January 10–14). New generation of radar systems enabled with cognition. Proceedings of the 2010 IEEE Radar Conference, Arlington, VA, USA.
3. Cognitive radar: A way of the future;Haykin;IEEE Signal Process. Mag.,2006
4. Darpa, A. (2010). Behavioral learning for adaptive electronic warfare. Darpa-BAA-10-79, Defense Advanced Research Projects Agency.
5. DARPA seeks proposals for adaptive radar countermeasures;Haystead;J. Electron. Def.,2012
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献