Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV With Thrust Vectoring Rotors-Reference-Cited by-同舟云学术

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV With Thrust Vectoring Rotors

Published:2020-10-05 Issue: Volume: Page:
ISSN:
Container-title:Volume 2: Intelligent Transportation/Vehicles; Manufacturing; Mechatronics; Engine/After-Treatment Systems; Soft Actuators/Manipulators; Modeling/Validation; Motion/Vibration Control Applications; Multi-Agent/Networked Systems; Path Planning/Motion Control; Renewable/Smart Energy Systems; Security/Privacy of Cyber-Physical Systems; Sensors/Actuators; Tracking Control Systems; Unmanned Ground/Aerial Vehicles; Vehicle Dynamics, Estimation, Control; Vibration/Control Systems; Vibrations
language:
Short-container-title:

Author:

Deshpande Aditya M.¹,Kumar Rumit¹,Minai Ali A.¹,Kumar Manish¹

Affiliation:

1. University of Cincinnati

Abstract

Abstract In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to learning the control policy from scratch for this new UAV design created by modifications in a conventional quadcopter, i.e., the addition of more degrees of freedom (4-actuators in conventional quadcopter to 8-actuators in tilt-rotor quadcopter). We demonstrate the robustness of our learned policy by showing the recovery of the tilt-rotor platform in the simulation from various non-static initial conditions in order to reach a desired state. The developmental policy for the tilt-rotor UAV also showed superior fault tolerance when compared with the policy learned from the scratch. The results show the ability of the presented approach to bootstrap the learned behavior from a simpler system (lower-dimensional action-space) to a more complex robot (comparatively higher-dimensional action-space) and reach better performance faster.

Publisher

American Society of Mechanical Engineers

Link

http://asmedigitalcollection.asme.org/DSCC/proceedings-pdf/doi/10.1115/DSCC2020-3319/6622427/v002t36a011-dscc2020-3319.pdf

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning;2023 Latin American Robotics Symposium (LARS), 2023 Brazilian Symposium on Robotics (SBR), and 2023 Workshop on Robotics in Education (WRE);2023-10-09

2. Physics-Based Neural Networks for Modeling & Control of Aerial Vehicles;2022 American Control Conference (ACC);2022-06-08

3. Bio-Inspired Feedback Linearized Adaptive Control For a Thrust Vectoring Free-Flyer Vehicle;Journal of Intelligent & Robotic Systems;2021-05-24

4. Robust Deep Reinforcement Learning for Quadcopter Control;IFAC-PapersOnLine;2021

5. Flight Control of a Multicopter using Reinforcement Learning;IFAC-PapersOnLine;2021