Application of Deep Reinforcement Learning to Defense and Intrusion Strategies Using Unmanned Aerial Vehicles in a Versus Game-Reference-Cited by-同舟云学术

Application of Deep Reinforcement Learning to Defense and Intrusion Strategies Using Unmanned Aerial Vehicles in a Versus Game

Published:2024-07-31 Issue:8 Volume:8 Page:365
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Chen Chieh-Li¹^ORCID,Huang Yu-Wen¹,Shen Ting-Ju¹^ORCID

Affiliation:

1. Department of Aeronautics and Astronautics, National Cheng Kung University, Tainan 701, Taiwan

Abstract

Drones are used in complex scenes in different scenarios. Efficient and effective algorithms are required for drones to track targets of interest and protect allied targets in a versus game. This study used physical models of quadcopters and scene engines to investigate the resulting performance of attacker drones and defensive drones based on deep reinforcement learning. The deep reinforcement learning network soft actor-critic was applied in association with the proposed reward and penalty functions according to the design scenario. AirSim UAV physical modeling and mission scenarios based on Unreal Engine were used to simultaneously train attacking and defending gaming skills for both drones, such that the required combat strategies and flight skills could be improved through a series of competition episodes. After 500 episodes of practice experience, both drones could accelerate, detour, and evade to achieve reasonably good performance with a roughly tie situation. Validation scenarios also demonstrated that the attacker–defender winning ratio also improved from 1:2 to 1.2:1, which is reasonable for drones with equal flight capabilities. Although this showed that the attacker may have an advantage in inexperienced scenarios, it revealed that the strategies generated by deep reinforcement learning networks are robust and feasible.

Funder

The Armaments Bureau, MND

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-446X/8/8/365/pdf

Reference22 articles.

1. Q-learning;Watkins;Mach. Learn.,1992

2. Hosu, I.A., and Rebedea, T. (2016). Playing atari games with deep reinforcement learning and human checkpoint replay. arXiv.

3. Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., and Riedmiller, M. (2013). Playing atari with deep reinforcement learning. arXiv.

4. Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., and Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv.

5. Gu, S., Lillicrap, T., Sutskever, I., and Levine, S. (2016, January 20–22). Continuous deep q-learning with model-based acceleration. Proceedings of the International Conference on Machine Learning, PMLR, New York, NY, USA.