Reinforcement Learning for UAV Attitude Control-Reference-Cited by-同舟云学术

Reinforcement Learning for UAV Attitude Control

Published:2019-04-30 Issue:2 Volume:3 Page:1-21
ISSN:2378-962X
Container-title:ACM Transactions on Cyber-Physical Systems
language:en
Short-container-title:ACM Trans. Cyber-Phys. Syst.

Author:

Koch William¹^ORCID,Mancuso Renato¹,West Richard¹,Bestavros Azer¹

Affiliation:

1. Boston University, Boston, MA, USA

Abstract

Autopilot systems are typically composed of an “inner loop” providing stability and control, whereas an “outer loop” is responsible for mission-level objectives, such as way-point navigation. Autopilot systems for unmanned aerial vehicles are predominately implemented using Proportional-Integral-Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However, more sophisticated control is required to operate in unpredictable and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL), which has had success in other applications, such as robotics. Yet previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude control when using intelligent flight control systems trained with state-of-the-art RL algorithms—Deep Deterministic Policy Gradient, Trust Region Policy Optimization, and Proximal Policy Optimization. To investigate these unknowns, we first developed an open source high-fidelity simulation environment to train a flight controller attitude control of a quadrotor through RL. We then used our environment to compare their performance to that of a PID controller to identify if using RL is appropriate in high-precision, time-critical flight control.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Control and Optimization,Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3301273

Reference41 articles.

Cited by 276 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Simulation-based evaluation of model-free reinforcement learning algorithms for quadcopter attitude control and trajectory tracking;Neurocomputing;2024-12

2. A survey on reinforcement learning in aviation applications;Engineering Applications of Artificial Intelligence;2024-10

3. Reinforcement learning-based drone simulators: survey, practice, and challenge;Artificial Intelligence Review;2024-09-05

4. Distributed consensus and formation control of multi-AUV systems under actuator faults and switching topology;European Journal of Control;2024-09

5. Deep Reinforcement Learning for sim-to-real policy transfer of VTOL-UAVs offshore docking operations;Applied Soft Computing;2024-09