Affiliation:
1. Aviation Engineering College, Air Force Engineering University, Xi’an 710038, China
Abstract
Aiming at the problem that unmanned combat aerial vehicle (UCAV) is difficult to quickly and accurately perceive situation information and make maneuvering decision autonomously in modern air combat, which is easily affected by complex factors, a maneuvering decision algorithm of UCAV combined with deep reinforcement learning and game theory is proposed in this paper. Firstly, through the UCAV dynamics model and maneuver library, a reasonable air combat situation assessment model and advantage reward function are established, and the sample data of situation assessment indicators are constructed using the structure entropy weight method. Secondly, the convolutional neural network (CNN) is used to process the high-dimensional continuous situation features of UCAV in air combat, eliminate the correlation and redundancy between situation features, and train the neural network to approximate the action-value function. Then, the double deep Q network (DDQN) algorithm in reinforcement learning (RL) is introduced to train the agent by the interaction with the environment and combined with Minimax algorithm in stochastic game theory to solve the optimal value function in each specific state, and the optimal maneuver decision of UCAV is obtained. Air combat simulation results show that UCAV can choose maneuvers autonomously under different situations and occupy a dominant position quickly by this method, which greatly improves the combat effectiveness of UCAV.
Funder
Air Force Engineering University
Reference30 articles.
1. Development of future fighters;W. Yang;Acta Aeronautica et Astronautica Sinica,2020
2. Study on the design of air combat maneuver library;K. Q. Zhu;Aeronautical Computer Technique,2001
3. Maneuver library and integrated control system for autonomous close-in air combat;Y. W. Zhong;Acta Aeronautica et Astronautica Sinica,2008
4. Autonomous air combat maneuver decision using Bayesian inference and moving horizon optimization
5. Air Combat Maneuver Decision Based on Reinforcement Genetic Algorithm
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献