Enhancing Quadrotor Control Robustness with Multi-Proportional–Integral–Derivative Self-Attention-Guided Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Enhancing Quadrotor Control Robustness with Multi-Proportional–Integral–Derivative Self-Attention-Guided Deep Reinforcement Learning

Published:2024-07-10 Issue:7 Volume:8 Page:315
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Ren Yahui¹,Zhu Feng¹,Sui Shuaishuai¹,Yi Zhengming²,Chen Kai¹

Affiliation:

1. College of Systems Engineering, National University of Defense Technology, Changsha 410073, China

2. College of Information and Intelligence, Hunan Agricultural University, Changsha 410073, China

Abstract

Deep reinforcement learning has demonstrated flexibility advantages in the control field of quadrotor aircraft. However, when there are sudden disturbances in the environment, especially special disturbances beyond experience, the algorithm often finds it difficult to maintain good control performance. Additionally, due to the randomness in the algorithm’s exploration of states, the model’s improvement efficiency during the training process is low and unstable. To address these issues, we propose a deep reinforcement learning framework guided by Multi-PID Self-Attention to tackle the challenges in the training speed and environmental adaptability of quadrotor aircraft control algorithms. In constructing the simulation experiment environment, we introduce multiple disturbance models to simulate complex situations in the real world. By combining the PID control strategy with deep reinforcement learning and utilizing the multi-head self-attention mechanism to optimize the state reward function in the simulation environment, this framework achieves an efficient and stable training process. This experiment aims to train a quadrotor simulation model to accurately fly to a predetermined position under various disturbance conditions and subsequently maintain a stable hovering state. The experimental results show that, compared with traditional deep reinforcement learning algorithms, this method achieves significant improvements in training efficiency and state exploration ability. At the same time, this study deeply analyzes the application effect of the algorithm in different complex environments, verifies its superior robustness and generalization ability in dealing with environmental disturbances, and provides a new solution for the intelligent control of quadrotor aircraft.

Funder

Hunan Provincial Department of Education Scientific Research Outstanding Youth Project

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-446X/8/7/315/pdf

Reference30 articles.

1. The role of information and communication technologies (ICTs) in household energy consumption-prospects for the UK;Martiskainen;Energy Effic.,2011

2. Mohsan, S.A.H., Khan, M.A., Noor, F., Ullah, I., and Alsharif, M.H. (2022). Towards the unmanned aerial vehicles (UAVs): A comprehensive review. Drones, 6.

3. Liu, R., Nageotte, F., Zanne, P., de Mathelin, M., and Dresp-Langley, B. (2021). Deep reinforcement learning for the control of robotic manipulation: A focussed mini-review. Robotics, 10.

4. Machine Learning, Deep Learning and Statistical Analysis for forecasting building energy consumption—A systematic review;Khalil;Eng. Appl. Artif. Intell.,2022

5. Willis, M.J. (1999). Proportional-Integral-Derivative Control, Department of Chemical and Process Engineering, University of Newcastle.