Two‐order cooperative optimization of swarm control based on reinforcement learning

Author:

Yu Dengxiu1ORCID,Qin Zhenhao1,Chen Kang1,Cheong Kang Hao2,Chen C. L. Philip3

Affiliation:

1. Unmanned System Research Institute Northwestern Polytechnical University Xi'an China

2. Science, Mathematics and Technology Cluster Singapore University of Technology and Design Singapore

3. School of Computer Science and Engineering South China University of Technology Guangzhou China

Abstract

AbstractThis paper presents a study of the cooperative optimal swarm control problem for two‐order multi‐agent systems with partially unknown nonlinear functions. Unlike traditional approaches that consider a single error, this paper proposes to use multi‐order errors in the performance index function to achieve optimal control performance. Additionally, different proportional coefficients are assigned to illustrate the varying influences of each sequence error, and a two‐order cooperative (TOC)performance index function is designed. To address the influence of unknown nonlinear functions, a swarm control system based on sliding mode control with an actor‐critic network is constructed, which increases the applicability of the proposed method to a variety of dynamic models. Furthermore, to alleviate the computational pressure caused by the multi‐order errors in the TOC performance index function, a new reinforcement learning (RL)‐based sliding mode swarm controller is designed. The stability of the proposed controller is demonstrated using the Lyapunov function. Finally, the control model and control rate are applied to a quadrotor unmanned aerial vehicle system, and simulation results demonstrate that the multi‐agent systems can effectively achieve swarm control.Impact Statement: This paper proposes a reinforcement learning‐based sliding mode control strategy for the cooperative optimal swarm control problem, where the nonlinear functions of two‐order multi‐agent systems are only partially known. In addition, we also propose a cooperative performance index function, which takes into account multi‐order errors for optimizing the performance. This contribution is significant for research in sliding mode control strategies and error co‐optimization.

Funder

China Postdoctoral Science Foundation

Publisher

Institution of Engineering and Technology (IET)

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Science Applications,Human-Computer Interaction,Control and Systems Engineering

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3