Hierarchical Active Tracking Control for UAVs via Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Hierarchical Active Tracking Control for UAVs via Deep Reinforcement Learning

Published:2021-11-11 Issue:22 Volume:11 Page:10595
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhao Wenlong^ORCID,Meng Zhijun,Wang Kaipeng,Zhang Jiahui,Lu Shaoze

Abstract

Active tracking control is essential for UAVs to perform autonomous operations in GPS-denied environments. In the active tracking task, UAVs take high-dimensional raw images as input and execute motor actions to actively follow the dynamic target. Most research focuses on three-stage methods, which entail perception first, followed by high-level decision-making based on extracted spatial information of the dynamic target, and then UAV movement control, using a low-level dynamic controller. Perception methods based on deep neural networks are powerful but require considerable effort for manual ground truth labeling. Instead, we unify the perception and decision-making stages using a high-level controller and then leverage deep reinforcement learning to learn the mapping from raw images to the high-level action commands in the V-REP-based environment, where simulation data are infinite and inexpensive. This end-to-end method also has the advantages of a small parameter size and reduced effort requirements for parameter turning in the decision-making stage. The high-level controller, which has a novel architecture, explicitly encodes the spatial and temporal features of the dynamic target. Auxiliary segmentation and motion-in-depth losses are introduced to generate denser training signals for the high-level controller’s fast and stable training. The high-level controller and a conventional low-level PID controller constitute our hierarchical active tracking control framework for the UAVs’ active tracking task. Simulation experiments show that our controller trained with several augmentation techniques sufficiently generalizes dynamic targets with random appearances and velocities, and achieves significantly better performance, compared with three-stage methods.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/22/10595/pdf

Reference32 articles.

1. Vision based ground object tracking using AR.Drone quadrotor

2. Vision based GPS-denied Object Tracking and following for unmanned aerial vehicles

3. 3D object following based on visual information for Unmanned Aerial Vehicles

4. Distinctive Image Features from Scale-Invariant Keypoints

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. D-VAT: End-to-End Visual Active Tracking for Micro Aerial Vehicles;IEEE Robotics and Automation Letters;2024-06

2. Quadrotor Control System Design for Robust Monocular Visual Tracking;IEEE Transactions on Control Systems Technology;2024

3. Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction;Artificial Intelligence Review;2023-12-28

4. Data Collecting and Monitoring for Photovoltaic System: A Deep-Q-Learning-Based Unmanned Aerial Vehicle-Assisted Scheme;Applied Sciences;2023-10-24

5. Deep Reinforcement Learning Tf-Agent-Based Object Tracking With Virtual Autonomous Drone in a Game Engine;IEEE Access;2023