Autonomous maneuver strategy of swarm air combat based on DDPG

Author:

Wang LuheORCID,Hu Jinwen,Xu Zhao,Zhao Chunhui

Abstract

AbstractUnmanned aerial vehicles (UAVs) have been found significantly important in the air combats, where intelligent and swarms of UAVs will be able to tackle with the tasks of high complexity and dynamics. The key to empower the UAVs with such capability is the autonomous maneuver decision making. In this paper, an autonomous maneuver strategy of UAV swarms in beyond visual range air combat based on reinforcement learning is proposed. First, based on the process of air combat and the constraints of the swarm, the motion model of UAV and the multi-to-one air combat model are established. Second, a two-stage maneuver strategy based on air combat principles is designed which include inter-vehicle collaboration and target-vehicle confrontation. Then, a swarm air combat algorithm based on deep deterministic policy gradient strategy (DDPG) is proposed for online strategy training. Finally, the effectiveness of the proposed algorithm is validated by multi-scene simulations. The results show that the algorithm is suitable for UAV swarms of different scales.

Funder

foundation of cetc key laboratory of data link technology

national natural science foundation of china

the key research and development project of shaanxi province

the aeronautical science foundation of china

the china postdoctoral science foundation

Publisher

Springer Science and Business Media LLC

Reference39 articles.

1. Y. Li, X. Qiu, X. Liu, Q. Xia, Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of ucavs. J. Syst. Eng. Electron.31(4), 734–742 (2020).

2. D. Hu, R. Yang, J. Zuo, Z. Zhang, Y. Wang, Application of deep reinforcement learning in maneuver planning of beyond-visual-range air combat. IEEE Access. PP(99), 1–1 (2021).

3. A. Xu, X. Chen, Z. W. Li, X. D. Hu, A method of situation assessment for beyond-visual-range air combat based on tactical attack area. Fire Control Command Control. 45(9), 97–102 (2020).

4. Z. H. Hu, Y. Lv, A. Xu, A threat assessment method for beyond-visual-range air combat based on situation prediction. Electron. Opt. Control. 27(3), 8–1226 (2020).

5. W. H. Wu, S. Y. Zhou, L. Gao, J. T. Liu, Improvements of situation assessment for beyond-visual-range air combat based on missile launching envelope analysis. Syst. Eng. Electron.33(12), 2679–2685 (2011).

Cited by 13 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Mean policy-based proximal policy optimization for maneuvering decision in multi-UAV air combat;Neural Computing and Applications;2024-08-07

2. Hierarchical Multi-Agent Reinforcement Learning for Air Combat Maneuvering;2023 International Conference on Machine Learning and Applications (ICMLA);2023-12-15

3. Research on Maneuvering Control Algorithm of Short-Range UAV Air Combat Based on Deep Reinforcement Learning;2023 2nd International Conference on Machine Learning, Cloud Computing and Intelligent Mining (MLCCIM);2023-07-25

4. A heuristic maintenance scheduling framework for a military aircraft fleet under limited maintenance capacities;Reliability Engineering & System Safety;2023-07

5. Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning;Machines;2022-11-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3