UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning-Reference-Cited by-同舟云学术

UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

Published:2020-11-18 Issue:22 Volume:12 Page:3789
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Li Bo^ORCID,Gan Zhigang,Chen Daqing^ORCID,Sergey Aleksandrovich Dyachenko

Abstract

This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/12/22/3789/pdf

Reference36 articles.

1. Towards an Autonomous Vision-Based Unmanned Aerial System against Wildlife Poachers

2. Safety, Security, and Rescue Missions with an Unmanned Aerial Vehicle (UAV)

3. Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning

Cited by 55 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. P-DRL: A Framework for Multi-UAVs Dynamic Formation Control under Operational Uncertainty and Unknown Environment;Drones;2024-09-10

2. A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game;Drones;2024-07-09

3. MERA: Meta-Learning Based Runtime Adaptation for Industrial Wireless Sensor-Actuator Networks;ACM Transactions on Sensor Networks;2024-07-08

4. Multi-UAV roundup strategy method based on deep reinforcement learning CEL-MADDPG algorithm;Expert Systems with Applications;2024-07

5. Meta-Reinforcement Learning Based Cooperative Surface Inspection of 3D Uncertain Structures using Multi-robot Systems;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13