Online Inverse Optimal Control for Time-Varying Cost Weights-Reference-Cited by-同舟云学术

Online Inverse Optimal Control for Time-Varying Cost Weights

Published:2024-01-31 Issue:2 Volume:9 Page:84
ISSN:2313-7673
Container-title:Biomimetics
language:en
Short-container-title:Biomimetics

Author:

Cao Sheng¹^ORCID,Luo Zhiwei¹,Quan Changqin¹

Affiliation:

1. Graduate School of System Informatics, Kobe University, 1-1 Rokkodai-cho, Nada-ku, Kobe 657-8501, Japan

Abstract

Inverse optimal control is a method for recovering the cost function used in an optimal control problem in expert demonstrations. Most studies on inverse optimal control have focused on building the unknown cost function through the linear combination of given features with unknown cost weights, which are generally considered to be constant. However, in many real-world applications, the cost weights may vary over time. In this study, we propose an adaptive online inverse optimal control approach based on a neural-network approximation to address the challenge of recovering time-varying cost weights. We conduct a well-posedness analysis of the problem and suggest a condition for the adaptive goal, under which the weights of the neural network generated to achieve this adaptive goal are unique to the corresponding inverse optimal control problem. Furthermore, we propose an updating law for the weights of the neural network to ensure the stability of the convergence of the solutions. Finally, simulation results for an example linear system are presented to demonstrate the effectiveness of the proposed strategy. The proposed method is applicable to a wide range of problems requiring real-time inverse optimal control calculations.

Publisher

MDPI AG

Link

https://www.mdpi.com/2313-7673/9/2/84/pdf

Reference29 articles.

1. Control of Mammalian Locomotion by Somatosensory Feedback;Frigon;Compr. Physiol.,2021

2. A framework of human–robot coordination based on game theory and policy iteration;Li;IEEE Trans. Robot.,2016

3. Ziebart, B.D., Maas, A.L., Bagnell, J.A., and Dey, A.K. (2009, January 23–25). Human Behavior Modeling with Maximum Entropy Inverse Optimal Control. Proceedings of the AAAI Spring Symposium: Human Behavior Modeling, Stanford, CA, USA.

4. Berret, B., Chiovetto, E., Nori, F., and Pozzo, T. (2011). Evidence for composite cost functions in arm movement planning: An inverse optimal control approach. PLoS Comput. Biol., 7.

5. Adaptive learning of human motor behaviors: An evolving inverse optimal control approach;Abouelsoud;Eng. Appl. Artif. Intell.,2016