A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments-Reference-Cited by-同舟云学术

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments

Published:2023-10-09 Issue: Volume: Page:
ISSN:2468-2322
Container-title:CAAI Transactions on Intelligence Technology
language:en
Short-container-title:CAAI Trans on Intel Tech

Author:

Zhou Zhiqian¹^ORCID,Ren Junkai¹^ORCID,Zeng Zhiwen¹,Xiao Junhao¹,Zhang Xinglong¹^ORCID,Guo Xian²,Zhou Zongtan¹,Lu Huimin¹

Affiliation:

1. College of Intelligence Science and Technology National University of Defense Technology Changsha China

2. Institute of Robotics and Automatic Information System, College of Artificial Intelligence NanKai University Tianjin China

Abstract

AbstractWhen deploying mobile robots in real‐world scenarios, such as airports, train stations, hospitals, and schools, collisions with pedestrians are intolerable and catastrophic. Motion safety becomes one of the most fundamental requirements for mobile robots. However, until now, efficient and safe robot navigation in such dynamic environments is still an open problem. The critical reason is that the inconsistency between navigation efficiency and motion safety is greatly intensified by the high dynamics and uncertainties of pedestrians. To face the challenge, this paper proposes a safe deep reinforcement learning algorithm named Conflict‐Averse Safe Reinforcement Learning (CASRL) for autonomous robot navigation in dynamic environments. Specifically, it first separates the collision avoidance sub‐task from the overall navigation task and maintains a safety critic to evaluate the safety/risk of actions. Later, it constructs two task‐specific but model‐agnostic policy gradients for goal‐reaching and collision avoidance sub‐tasks to eliminate their mutual interference. Then, it further performs a conflict‐averse gradient manipulation to address the inconsistency between two sub‐tasks. Finally, extensive experiments are performed to evaluate the superiority of CASRL. Simulation results show an average 8.2% performance improvement over the vanilla baseline in eight groups of dynamic environments, which is further extended to 13.4% in the most challenging group. Besides, forty real‐world experiments fully illustrated that the CASRL could be successfully deployed on a real robot.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Artificial Intelligence,Computer Networks and Communications,Computer Vision and Pattern Recognition,Human-Computer Interaction,Information Systems

Reference52 articles.

1. Proactive kinodynamic planning using the Extended Social Force Model and human motion prediction in urban environments

2. Interactive model predictive control for robot navigation in dense crowds;Chen Y.;IEEE Trans. Syst. Man Cybernetics,2021

3. Receding Horizon Control with Trajectron++: Navigating Mobile Robots in the Crowd

4. Unfreezing the robot: Navigation in dense, interacting crowds

5. Robot Navigation in Crowds by Graph Convolutional Networks With Attention Learned From Human Gaze

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Physical Education: A Dynamic Fuzzy Neural Network-Based Information Processing System Design;IEEE Access;2024