Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Navigation in Unknown Dynamic Environments Based on Deep Reinforcement Learning

Published:2019-09-05 Issue:18 Volume:19 Page:3837
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zeng Junjie^ORCID,Ju Rusheng,Qin Long^ORCID,Hu Yue,Yin Quanjun,Hu Cong

Abstract

In this paper, we propose a novel Deep Reinforcement Learning (DRL) algorithm which can navigate non-holonomic robots with continuous control in an unknown dynamic environment with moving obstacles. We call the approach MK-A3C (Memory and Knowledge-based Asynchronous Advantage Actor-Critic) for short. As its first component, MK-A3C builds a GRU-based memory neural network to enhance the robot’s capability for temporal reasoning. Robots without it tend to suffer from a lack of rationality in face of incomplete and noisy estimations for complex environments. Additionally, robots with certain memory ability endowed by MK-A3C can avoid local minima traps by estimating the environmental model. Secondly, MK-A3C combines the domain knowledge-based reward function and the transfer learning-based training task architecture, which can solve the non-convergence policies problems caused by sparse reward. These improvements of MK-A3C can efficiently navigate robots in unknown dynamic environments, and satisfy kinetic constraints while handling moving objects. Simulation experiments show that compared with existing methods, MK-A3C can realize successful robotic navigation in unknown and challenging environments by outputting continuous acceleration commands.

Funder

National Science Foundation of Hunan Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/19/18/3837/pdf

Reference38 articles.

1. A Review of Real-Time Strategy Game AI

2. Algorithms for collision-free navigation of mobile robots in complex cluttered environments: a survey

3. Seeking a path through the crowd: Robot navigation in unknown dynamic environments with moving obstacles based on an integrated environment representation

Cited by 51 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cooperative Path Following Control in Autonomous Vehicles Graphical Games: A Data-Based Off-Policy Learning Approach;IEEE Transactions on Intelligent Transportation Systems;2024-08

2. NavFormer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments;IEEE Robotics and Automation Letters;2024-08

3. Navigation Based on Hybrid Decentralized and Centralized Training and Execution Strategy for Multiple Mobile Robots Reinforcement Learning;Electronics;2024-07-24

4. Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology;2024 International Wireless Communications and Mobile Computing (IWCMC);2024-05-27

5. NaviFormer: A Data-Driven Robot Navigation Approach via Sequence Modeling and Path Planning with Safety Verification;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13