A Cognitive Electronic Jamming Decision-Making Method Based on Q-Learning and Ant Colony Fusion Algorithm-Reference-Cited by-同舟云学术

A Cognitive Electronic Jamming Decision-Making Method Based on Q-Learning and Ant Colony Fusion Algorithm

Published:2023-06-14 Issue:12 Volume:15 Page:3108
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Zhang Chudi¹^ORCID,Song Yunqi¹,Jiang Rundong¹^ORCID,Hu Jun¹^ORCID,Xu Shiyou¹

Affiliation:

1. School of Electronics and Communication Engineering, Sun Yat-sen University, Shenzhen 528406, China

Abstract

In order to improve the efficiency and adaptability of cognitive radar jamming decision-making, a fusion algorithm (Ant-QL) based on ant colony and Q-Learning is proposed in this paper. The algorithm does not rely on a priori information and enhances adaptability through real-time interactions between the jammer and the target radar. At the same time, it can be applied to single jammer and multiple jammer countermeasure scenarios with high jamming effects. First, traditional Q-Learning and DQN algorithms are discussed, and a radar jamming decision-making model is built for the simulation verification of each algorithm. Then, an improved Q-Learning algorithm is proposed to address the shortcomings of both algorithms. By introducing the pheromone mechanism of ant colony algorithms in Q-Learning and using the ε-greedy algorithm to balance the contradictory relationship between exploration and exploitation, the algorithm greatly avoids falling into a local optimum, thus accelerating the convergence speed of the algorithm with good stability and robustness in the convergence process. In order to better adapt to the cluster countermeasure environment in future battlefields, the algorithm and model are extended to cluster cooperative jamming decision-making. We map each jammer in the cluster to an intelligent ant searching for the optimal path, and multiple jammers interact with each other to obtain information. During the process of confrontation, the method greatly improves the convergence speed and stability and reduces the need for hardware and power resources of the jammer. Assuming that the number of jammers is three, the experimental simulation results of the convergence speed of the Ant-QL algorithm improve by 85.4%, 80.56% and 72% compared with the Q-Learning, DQN and improved Q-Learning algorithms, respectively. During the convergence process, the Ant-QL algorithm is very stable and efficient, and the algorithm complexity is low. After the algorithms converge, the average response times of the four algorithms are 6.99 × 10−4 s, 2.234 × 10−3 s, 2.21 × 10−4 s and 1.7 × 10−4 s, respectively. The results show that the improved Q-Learning algorithm and Ant-QL algorithm also have more advantages in terms of average response time after convergence.

Funder

National Key R&D Program of China

Shenzhen Fundamental Research Programunder

Shenzhen Science and Technology Program

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/12/3108/pdf

Reference59 articles.

1. Haigh, K., and Andrusenko, J. (2021). Cognitive Electronic Warfare: An Artificial Intelligence Approach, Artech House.

2. Haykin, S. (2010, January 10–14). New generation of radar systems enabled with cognition. Proceedings of the 2010 IEEE Radar Conference, Arlington, VA, USA.

3. Cognitive radar: A way of the future;Haykin;IEEE Signal Process. Mag.,2006

4. Darpa, A. (2010). Behavioral Learning for Adaptive Electronic Warfare, Defense Advanced Research Projects Agency. Darpa-BAA-10-79.

5. DARPA seeks proposals for adaptive radar countermeasures J;Haystead;J. Electron. Def.,2012

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi‐agent multi‐dimensional joint optimisation of jamming decision‐making against multi‐functional radar;IET Radar, Sonar & Navigation;2024-09-02

2. Efficient Jamming Policy Generation Method Based on Multi-Timescale Ensemble Q-Learning;Remote Sensing;2024-08-27

3. Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning;Remote Sensing;2024-05-29

4. Radar-Jamming Decision-Making Based on Improved Q-Learning and FPGA Hardware Implementation;Remote Sensing;2024-03-28

5. Distributed Jamming Method Based on Spatial Superposition Effect: A Review;2023 3rd International Conference on Electronic Information Engineering and Computer Science (EIECS);2023-09-22