Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning

Published:2020-08-27 Issue:17 Volume:20 Page:4836
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Chen Guangda^ORCID,Yao Shunyi,Ma Jun,Pan Lifan,Chen Yu’an,Xu Pei,Ji Jianmin^ORCID,Chen Xiaoping

Abstract

It is challenging to avoid obstacles safely and efficiently for multiple robots of different shapes in distributed and communication-free scenarios, where robots do not communicate with each other and only sense other robots’ positions and obstacles around them. Most existing multi-robot collision avoidance systems either require communication between robots or require expensive movement data of other robots, like velocities, accelerations and paths. In this paper, we propose a map-based deep reinforcement learning approach for multi-robot collision avoidance in a distributed and communication-free environment. We use the egocentric local grid map of a robot to represent the environmental information around it including its shape and observable appearances of other robots and obstacles, which can be easily generated by using multiple sensors or sensor fusion. Then we apply the distributed proximal policy optimization (DPPO) algorithm to train a convolutional neural network that directly maps three frames of egocentric local grid maps and the robot’s relative local goal positions into low-level robot control commands. Compared to other methods, the map-based approach is more robust to noisy sensor data, does not require robots’ movement data and considers sizes and shapes of related robots, which make it to be more efficient and easier to be deployed to real robots. We first train the neural network in a specified simulator of multiple mobile robots using DPPO, where a multi-stage curriculum learning strategy for multiple scenarios is used to improve the performance. Then we deploy the trained model to real robots to perform collision avoidance in their navigation without tedious parameter tuning. We evaluate the approach with multiple scenarios both in the simulator and on four differential-drive mobile robots in the real world. Both qualitative and quantitative experiments show that our approach is efficient and outperforms existing DRL-based approaches in many indicators. We also conduct ablation studies showing the positive effects of using egocentric grid maps and multi-stage curriculum learning.

Funder

Science and Technology Planning Project of Guangdong Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/17/4836/pdf

Reference61 articles.

1. Master-followed Multiple Robots Cooperation SLAM Adapted to Search and Rescue Environment

2. A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand

3. Toward Socially Aware Robot Navigation in Dynamic and Crowded Environments: A Proactive Social Motion Model

4. Safe, multi-agent, reinforcement learning for autonomous driving;Shalev-Shwartz;arXiv,2016

5. Personalized Privacy-Preserving Task Allocation for Mobile Crowdsensing

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PathRL: An End-to-End Path Generation Method for Collision Avoidance via Deep Reinforcement Learning;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

2. A review of perception sensors, techniques, and hardware architectures for autonomous low-altitude UAVs in non-cooperative local obstacle avoidance;Robotics and Autonomous Systems;2024-03

3. Deep reinforcement learning in mobile robotics – a concise review;Multimedia Tools and Applications;2024-02-05

4. Feedback-Based Curriculum Learning for Collision Avoidance;IEEE Access;2024

5. Learning Complicated Navigation Skills from Limited Experience via Augmenting Offline Datasets;2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI);2023-11-06