Affiliation:
1. School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China
Abstract
Deep reinforcement learning technology applied to three-dimensional Unmanned Aerial Vehicle (UAV) air game maneuver decision-making often results in low utilization efficiency of training data and algorithm convergence difficulties. To address these issues, this study proposes an expert experience storage mechanism that improves the algorithm’s performance with less experience replay time. Based on this mechanism, a maneuver decision algorithm using the Dueling Double Deep Q Network is introduced. Simulation experiments demonstrate that the proposed mechanism significantly enhances the algorithm’s performance by reducing the experience by 81.3% compared to the prioritized experience replay mechanism, enabling the UAV agent to achieve a higher maximum average reward value. The experimental results suggest that the proposed expert experience storage mechanism improves the algorithm’s performance with less experience replay time. Additionally, the proposed maneuver decision algorithm identifies the optimal policy for attacking target UAVs using different fixed strategies.
Funder
National Natural Science Foundation (NSF) of China
Fundamental Research Funds for the Central Universities
Subject
Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering
Reference48 articles.
1. Kong, W., Zhou, D., Yang, Z., Zhao, Y., and Zhang, K. (2020). UAV autonomous aerial combat maneuver strategy generation with observation error based on state-adversarial deep deterministic policy gradient and inverse reinforcement learning. Electronics, 9.
2. Maneuver decision of UAV in short-range air combat based on deep reinforcement learning;Yang;IEEE Access,2019
3. Myerson, R.B. (1997). Game Theory: Analysis of Conflict, Harvard University Press.
4. Evaluating influence diagrams;Shachter;Oper. Res.,1986
5. Key technologies for air combat intelligent decision based on game confrontation;Chen;Command. Inf. Syst. Technol.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献