Adaptive Distributed Control for Leader–Follower Formation Based on a Recurrent SAC Algorithm-Reference-Cited by-同舟云学术

Adaptive Distributed Control for Leader–Follower Formation Based on a Recurrent SAC Algorithm

Published:2024-09-04 Issue:17 Volume:13 Page:3513
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Li Mingfei¹^ORCID,Liu Haibin¹^ORCID,Xie Feng¹,Huang He¹

Affiliation:

1. College of Mechanical & Energy Engineering, Beijing University of Technology, Beijing 100124, China

Abstract

This study proposes a novel adaptive distributed recurrent SAC (Soft Actor–Critic) control method to address the leader–follower formation control problem of omnidirectional mobile robots. Our method successfully eliminates the reliance on the complete state of the leader and achieves the task of formation solely using the pose between robots. Moreover, we develop a novel recurrent SAC reinforcement learning framework that ensures that the controller exhibits good transient and steady-state characteristics to achieve outstanding control performance. We also present an episode-based memory replay buffer and sampling approaches, along with a unique normalized reward function, which expedites the recurrent SAC reinforcement learning formation framework to converge rapidly and receive consistent incentives across various leader–follower tasks. This facilitates better learning and adaptation to the formation task requirements in different scenarios. Furthermore, to bolster the generalization capability of our method, we normalized the state space, effectively eliminating differences between formation tasks of different shapes. Different shapes of leader–follower formation experiments in the Gazebo simulator achieve excellent results, validating the efficacy of our method. Comparative experiments with traditional PID and common network controllers demonstrate that our method achieves faster convergence and greater robustness. These simulation results provide strong support for our study and demonstrate the potential and reliability of our method in solving real-world problems.

Funder

National Key Research and Development Program of China

Research Funds for Leading Talents Program

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/17/3513/pdf

Reference32 articles.

1. Bullo, F., Cortés, J., and Martinez, S. (2009). Distributed Control of Robotic Networks: A Mathematical Approach to Motion Coordination Algorithms, Princeton University Press.

2. Mesbahi, M. (2010). Graph Theoretic Methods in Multiagent Networks, Princeton University Press.

3. Kagan, E., Shvalb, N., and Ben-Gal, I. (2019). Autonomous Mobile Robots and Multi-Robot Systems: Motion-Planning, Communication, and Swarming, John Wiley & Sons.

4. Formation control and coordination of multiple unmanned ground vehicles in normal and faulty situations: A review;Kamel;Annu. Rev. Control,2020

5. A heuristic distributed task allocation method for multivehicle multitask problems and its application to search and rescue scenario;Zhao;IEEE Trans. Cybern.,2015