Affiliation:
1. Department of Automation Zhejiang University of Technology Hangzhou China
Abstract
SummaryA partial model‐free, data‐driven adaptive optimal output feedback (OPFB) control scheme with low computational cost continuous‐time is proposed in this paper. The design objective is to obtain the optimal control law by using measurable input and output data, without some knowledge of system model information. Firstly, the system states are decoupled into measurable and unmeasurable parts, and a new state‐space equation is built to estimate the unmeasurable states by using a reduced‐order observer. Based on this, a parametrization method is utilized to reconstruct the system states. Subsequently, by using the reconstructed states, the adaptive dynamic programming (ADP) Bellman equations based on policy‐iteration (PI) and value‐iteration (VI) are presented to solve the control problems with initially stable and unstable conditions, respectively. Then, the convergence of the system is proved. Compared with the early proposed OPFB algorithms, only the unknown internal state needs to be reconstructed. Therefore, the computation cost and design complexity are reduced for the proposed scheme. The effectiveness of the proposed scheme is verified through two numerical simulations. In addition, a practical inverted pendulum experiment is carried out to demonstrate the performance of the proposed scheme.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Zhejiang Province