Countering a Drone in a 3D Space: Analyzing Deep Reinforcement Learning Methods
Author:
Çetin EnderORCID, Barrado CristinaORCID, Pastor EnricORCID
Abstract
Unmanned aerial vehicles (UAV), also known as drones have been used for a variety of reasons and the commercial drone market growth is expected to reach remarkable levels in the near future. However, some drone users can mistakenly or intentionally fly into flight paths at major airports, flying too close to commercial aircraft or invading people’s privacy. In order to prevent these unwanted events, counter-drone technology is needed to eliminate threats from drones and hopefully they can be integrated into the skies safely. There are various counter-drone methods available in the industry. However, a counter-drone system supported by an artificial intelligence (AI) method can be an efficient way to fight against drones instead of human intervention. In this paper, a deep reinforcement learning (DRL) method has been proposed to counter a drone in a 3D space by using another drone. In a 2D space it is already shown that the deep reinforcement learning method is an effective way to counter a drone. However, countering a drone in a 3D space with another drone is a very challenging task considering the time required to train and avoid obstacles at the same time. A Deep Q-Network (DQN) algorithm with dueling network architecture and prioritized experience replay is presented to catch another drone in the environment provided by an Airsim simulator. The models have been trained and tested with different scenarios to analyze the learning progress of the drone. Experiences from previous training are also transferred before starting a new training by pre-processing the previous experiences and eliminating those considered as bad experiences. The results show that the best models are obtained with transfer learning and the drone learning progress has been increased dramatically. Additionally, an algorithm which combines imitation learning and reinforcement learning is implemented to catch the target drone. In this algorithm, called deep q-learning from demonstrations (DQfD), expert demonstrations data and self-generated data by the agent are sampled and the agent continues learning without overwriting the demonstration data. The main advantage of this algorithm is to accelerate the learning process even if there is a small amount of demonstration data.
Funder
AGAUR Ministry of Science and Innovation of Spain SESAR Joint Undertaking (JU) project CORUS-XUAM European Union’s Horizon 2020 research and innovation program
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference35 articles.
1. (2019, August 21). Anti-Drone Market “To Be Worth USD1.5 Billion” by 2023—New Report. Available online: https://www.unmannedairspace.info/utm-and-c-uas-market-analysis/anti-drone-market-to-be-worth-usd1-5-billion-by-2023-new-report/. 2. (2022, August 04). Drones by the Numbers, Available online: https://www.faa.gov/uas/resources/by_the_numbers/. 3. Chiper, F.L., Martian, A., Vladeanu, C., Marghescu, I., Craciunescu, R., and Fratu, O. (2022). Drone Detection and Defense Systems: Survey and a Software-Defined Radio-Based Solution. Sensors, 22. 4. Drozdowicz, J., Wielgo, M., Samczynski, P., Kulpa, K., Krzonkalla, J., Mordzonek, M., Bryl, M., and Jakielaszek, Z. (2016, January 10–12). 35 GHz FMCW drone detection system. Proceedings of the 2016 17th International Radar Symposium (IRS), Krakow, Poland. 5. Semkin, V., Yin, M., Hu, Y., Mezzavilla, M., and Rangan, S. (2021, January 25–28). Drone detection and classification based on radar cross section signatures. Proceedings of the 2020 International Symposium on Antennas and Propagation (ISAP), Osaka, Japan.
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|