Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control-Reference-Cited by-同舟云学术

Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control

Published:2022-12-31 Issue:1 Volume:13 Page:586
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Huang Xu^ORCID,Zhang Yuan^ORCID,Liu Jiarun,Zhong Honghao,Wang Zhaolei,Peng Yue

Abstract

A data-driven nonlinear control approach, called error dynamics-based dual heuristic dynamic programming (ED-DHP), is proposed for air vehicle attitude control. To solve the optimal tracking control problem, the augmented system is defined by the derived error dynamics and reference trajectory so that the actor neural network can learn the feedforward and feedback control terms at the same time. During the online self-learning process, the actor neural network learns the control policy by minimizing the augmented system’s value function. The input dynamics identified by the recursive least square (RLS) and output of the critic neural network are used to update the actor neural network. In addition, the total uncertainty term of the error dynamics is also identified by RLS, which can compensate for the uncertainty caused by inaccurate modeling, parameter perturbation, and so on. The outputs of ED-DHP include the rough trim surface, feedforward and feedback terms from the actor neural network, and the compensation. Based on this control scheme, the complete knowledge of system dynamics and the reference trajectory dynamics are not needed, and offline learning is unnecessary. To verify the self-learning ability of ED-DHP, two numerical experiments are carried out based on the established morphing air vehicle model. One is sinusoidal signal tracking at a fixed operating point, and the other is guidance command tracking with a morphing process at variable operating points. The simulation results demonstrate the good performance of ED-DHP for online self-learning attitude control and validate the robustness of the proposed scheme

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/1/586/pdf

Reference29 articles.

1. Review of advanced guidance and control algorithms for space/aerospace vehicles;Chai;Prog. Aerosp. Sci.,2021

2. Review of control and guidance technology on hypersonic vehicle;Ding;Chin. J. Aeronaut.,2022

3. Reinforcement Learning in Robotics: A Survey;Kober;Learning Motor Skills. Springer Tracts in Advanced Robotics,2014

4. Magnetic control of tokamak plasmas through deep reinforcement learning;Degrave;Nature,2022

5. Tang, X., Qin, Z.T., and Zhang, F. (2019, January 4–8). A Deep Value-network Based Approach for Multi-Driver Order Dispatching. Proceedings of the 25th ACM SIGKDD International Conference, Anchorage, AK, USA.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Non-Global Traversal Method for Dynamic Load Rapid Localization and Identification;Aerospace;2024-02-26

2. Online self-learning attitude tracking control of morphing unmanned aerial vehicle based on dual heuristic dynamic programming;Aerospace Science and Technology;2023-12

3. Special Issue on Advanced Technology of Intelligent Control and Simulation Evaluation;Applied Sciences;2023-09-28