Decentralized Reinforcement Learning Robust Optimal Tracking Control for Time Varying Constrained Reconfigurable Modular Robot Based on ACI andQ-Function-Reference-Cited by-同舟云学术

Decentralized Reinforcement Learning Robust Optimal Tracking Control for Time Varying Constrained Reconfigurable Modular Robot Based on ACI andQ-Function

Published:2013 Issue: Volume:2013 Page:1-16
ISSN:1024-123X
Container-title:Mathematical Problems in Engineering
language:en
Short-container-title:Mathematical Problems in Engineering

Author:

Dong Bo¹^ORCID,Li Yuanchun²^ORCID

Affiliation:

1. Department of Communication Engineering, Jilin University, Changchun 130022, China

2. Department of Control Engineering, Changchun University of Technology, Changchun 130012, China

Abstract

A novel decentralized reinforcement learning robust optimal tracking control theory for time varying constrained reconfigurable modular robots based on action-critic-identifier (ACI) and state-action value function (Q-function) has been presented to solve the problem of the continuous time nonlinear optimal control policy for strongly coupled uncertainty robotic system. The dynamics of time varying constrained reconfigurable modular robot is described as a synthesis of interconnected subsystem, and continuous time state equation andQ-function have been designed in this paper. Combining with ACI and RBF network, the global uncertainty of the subsystem and the HJB (Hamilton-Jacobi-Bellman) equation have been estimated, where critic-NN and action-NN are used to approximate the optimalQ-function and the optimal control policy, and the identifier is adopted to identify the global uncertainty as well as RBF-NN which is used to update the weights of ACI-NN. On this basis, a novel decentralized robust optimal tracking controller of the subsystem is proposed, so that the subsystem can track the desired trajectory and the tracking error can converge to zero in a finite time. The stability of ACI and the robust optimal tracking controller are confirmed by Lyapunov theory. Finally, comparative simulation examples are presented to illustrate the effectiveness of the proposed ACI and decentralized control theory.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Link

http://downloads.hindawi.com/journals/mpe/2013/387817.pdf

Reference22 articles.

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Critic Only Policy Iteration-based Zero-sum Neuro-optimal Control of Modular and Reconfigurable Robots with uncertain disturbance via Adaptive Dynamic Programming;2020 12th International Conference on Advanced Computational Intelligence (ICACI);2020-08

2. Model-free optimal decentralized sliding mode control for modular and reconfigurable robots based on adaptive dynamic programming;Advances in Mechanical Engineering;2019-12

3. Decentralized robust optimal control for modular robot manipulators via critic-identifier structure-based adaptive dynamic programming;Neural Computing and Applications;2018-09-21

4. Torque sensorless decentralized neuro-optimal control for modular and reconfigurable robots with uncertain environments;Neurocomputing;2018-03

5. Glycogen Synthase Kinase-3 Modulates Cbl-b and Constrains T Cell Activation;The Journal of Immunology;2017-11-06