Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method-Reference-Cited by-同舟云学术

Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method

Published:2023-05-04 Issue:11 Volume:17 Page:1443-1457
ISSN:1751-8644
Container-title:IET Control Theory & Applications
language:en
Short-container-title:IET Control Theory & Appl

Author:

Ji Lianghao¹^ORCID,Jian Kai¹,Zhang Cuijuan¹,Yang Shasha¹,Guo Xing¹,Li Huaqing²

Affiliation:

1. Chongqing Key Laboratory of Image Cognition Chongqing University of Posts and Telecommunications Chongqing China

2. College of Electronic and Information Engineering Southwest University Chongqing China

Abstract

AbstractThis paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete‐time multi‐agent systems with completely unknown dynamics. Different from the classical RL‐based optimal control algorithms based on one‐step temporal difference method, a multi‐step‐based (also call n‐step) policy gradient ADP (MS‐PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q‐function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor‐critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Science Applications,Human-Computer Interaction,Control and Systems Engineering

Reference43 articles.

1. Distributed algorithm for dynamic economic power dispatch with energy storage in smart grids

2. Chen Y. Guo W. Chen G.:A multi‐agent‐based adaptive task allocation algorithm in wireless sensor networks. In2009 International Conference on Information Engineering and Computer Science pp.1–4.IEEE Piscataway NJ(2009)

3. A Distributed Multi-Agent Based Emergency Control Approach Following Catastrophic Disturbances in Interconnected Power Systems

4. Leader‐follower formation of light‐weight UAVs with novel active disturbance rejection control;Li J.;Appl. Math. Comput.,2023

5. Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Policy iteration for H_∞ control of polynomial time‐varying systems;IET Control Theory & Applications;2024-04-12

2. Switching threshold event‐triggered critic algorithm for optimal orbit tracking and formation motion;IET Control Theory & Applications;2023-10-11

3. Automated control loop selection via multistage optimal control formulation and nonlinear programming;Chemical Engineering Research and Design;2023-07