Affiliation:
1. School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China
2. College of Control Science Engineering, Zhejiang University, Hangzhou 310027, China
Abstract
Multiple unmanned aerial vehicle (multi-UAV) cooperative air combat, which is an important form of future air combat, has high requirements for the autonomy and cooperation of unmanned aerial vehicles. Therefore, it is of great significance to study the decision-making method of multi-UAV cooperative air combat since the conventional methods are challenging to solve the high complexity and highly dynamic cooperative air combat problems. This paper proposes a multi-agent double-soft actor-critic (MADSAC) algorithm for solving the cooperative decision-making problem of multi-UAV. The MADSAC achieves multi-UAV cooperative air combat by treating the problem as a fully cooperative game using a decentralized partially observable Markov decision process and a centrally trained distributed execution framework. The use of maximum entropy theory in the update process makes the method more exploratory. Meanwhile, MADSAC uses double-centralized critics, target networks, and delayed policy updates to solve the overestimation and error accumulation problems effectively. In addition, the double-centralized critics based on the attention mechanism improve the scalability and learning efficiency of MADSAC. Finally, multi-UAV cooperative air combat experiments validate the effectiveness of MADSAC.
Funder
National Natural Science Foundation of China
Aeronautical Science Foundation of China
Reference37 articles.
1. Wireless Communications with Unmanned Aerial Vehicles: Opportunities and Challenges;Zeng;IEEE Commun. Mag.,2016
2. Tsach, S., Peled, A., Penn, D., Keshales, B., and Guedj, R. (2007, January 7–10). Development Trends for Next Generation of UAV Systems. Proceedings of the AIAA Infotech@Aerospace 2007 Conference and Exhibit, Rohnert Park, CA, USA.
3. Differential game based air combat maneuver generation using scoring function matrix;Park;Int. J. Aeronaut. Space,2016
4. UAV air combat decision based on evolutionary expert system tree;Wang;Ordnance Ind. Autom.,2019
5. Autonomous air combat maneuver decision using Bayesian inference and moving horizon optimization;Huang;J. Syst. Eng. Electron.,2018
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献