Affiliation:
1. College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
2. College of Information and Intelligence, Hunan Agricultural University, Changsha 410073, China
Abstract
Deep reinforcement learning has demonstrated flexibility advantages in the control field of quadrotor aircraft. However, when there are sudden disturbances in the environment, especially special disturbances beyond experience, the algorithm often finds it difficult to maintain good control performance. Additionally, due to the randomness in the algorithm’s exploration of states, the model’s improvement efficiency during the training process is low and unstable. To address these issues, we propose a deep reinforcement learning framework guided by Multi-PID Self-Attention to tackle the challenges in the training speed and environmental adaptability of quadrotor aircraft control algorithms. In constructing the simulation experiment environment, we introduce multiple disturbance models to simulate complex situations in the real world. By combining the PID control strategy with deep reinforcement learning and utilizing the multi-head self-attention mechanism to optimize the state reward function in the simulation environment, this framework achieves an efficient and stable training process. This experiment aims to train a quadrotor simulation model to accurately fly to a predetermined position under various disturbance conditions and subsequently maintain a stable hovering state. The experimental results show that, compared with traditional deep reinforcement learning algorithms, this method achieves significant improvements in training efficiency and state exploration ability. At the same time, this study deeply analyzes the application effect of the algorithm in different complex environments, verifies its superior robustness and generalization ability in dealing with environmental disturbances, and provides a new solution for the intelligent control of quadrotor aircraft.
Funder
Hunan Provincial Department of Education Scientific Research Outstanding Youth Project
Reference30 articles.
1. The role of information and communication technologies (ICTs) in household energy consumption-prospects for the UK;Martiskainen;Energy Effic.,2011
2. Mohsan, S.A.H., Khan, M.A., Noor, F., Ullah, I., and Alsharif, M.H. (2022). Towards the unmanned aerial vehicles (UAVs): A comprehensive review. Drones, 6.
3. Liu, R., Nageotte, F., Zanne, P., de Mathelin, M., and Dresp-Langley, B. (2021). Deep reinforcement learning for the control of robotic manipulation: A focussed mini-review. Robotics, 10.
4. Machine Learning, Deep Learning and Statistical Analysis for forecasting building energy consumption—A systematic review;Khalil;Eng. Appl. Artif. Intell.,2022
5. Willis, M.J. (1999). Proportional-Integral-Derivative Control, Department of Chemical and Process Engineering, University of Newcastle.