Sequencing of multi-robot behaviors using reinforcement learning-Reference-Cited by-同舟云学术

Sequencing of multi-robot behaviors using reinforcement learning

Published:2021-11 Issue:4 Volume:19 Page:529-537
ISSN:2095-6983
Container-title:Control Theory and Technology
language:en
Short-container-title:Control Theory Technol.

Author:

Pierpaoli Pietro,Doan Thinh T.,Romberg Justin,Egerstedt Magnus

Abstract

AbstractGiven a collection of parameterized multi-robot controllers associated with individual behaviors designed for particular tasks, this paper considers the problem of how to sequence and instantiate the behaviors for the purpose of completing a more complex, overarching mission. In addition, uncertainties about the environment or even the mission specifications may require the robots to learn, in a cooperative manner, how best to sequence the behaviors. In this paper, we approach this problem by using reinforcement learning to approximate the solution to the computationally intractable sequencing problem, combined with an online gradient descent approach to selecting the individual behavior parameters, while the transitions among behaviors are triggered automatically when the behaviors have reached a desired performance level relative to a task performance cost. To illustrate the effectiveness of the proposed method, it is implemented on a team of differential-drive robots for solving two different missions, namely, convoy protection and object manipulation.

Publisher

Springer Science and Business Media LLC

Subject

Control and Optimization,Aerospace Engineering,Control and Systems Engineering

Link

https://link.springer.com/content/pdf/10.1007/s11768-021-00069-5.pdf

Reference30 articles.

1. Antonelli, G. (2013). Interconnected dynamic systems: An overview on distributed control. IEEE Control Systems Magazine, 33(1), 76–88.

2. Cortés, J., & Egerstedt, M. (2017). Coordinated control of multi-robot systems: A survey. SICE Journal of Control, Measurement, and System Integration, 10(6), 495–503.

3. Oh, K. K., Park, M. C., & Ahn, H. S. (2015). A survey of multi-agent formation control. Automatica, 53, 424–440.

4. Schwager, M., Rus, D., & Slotine, J. J. (2011). Unifying geometric, probabilistic, and potential field approaches to multi-robot deployment. International Journal of Robotics Research, 30(3), 371–383.

5. Li, A., Wang, L., Pierpaoli, P., & Egerstedt, M. (2018). Formally correct composition of coordinated behaviors using control barrier certificates. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 3723–3729. Madrid, Spain.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Preference Learning for Complex Cobotic Tasks;IFAC-PapersOnLine;2023