Learning Reward Function with Matching Network for Mapless Navigation-Reference-Cited by-同舟云学术

Learning Reward Function with Matching Network for Mapless Navigation

Published:2020-06-30 Issue:13 Volume:20 Page:3664
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Qichen,Zhu Meiqiang,Zou Liang,Li Ming,Zhang Yong

Abstract

Deep reinforcement learning (DRL) has been successfully applied in mapless navigation. An important issue in DRL is to design a reward function for evaluating actions of agents. However, designing a robust and suitable reward function greatly depends on the designer’s experience and intuition. To address this concern, we consider employing reward shaping from trajectories on similar navigation tasks without human supervision, and propose a general reward function based on matching network (MN). The MN-based reward function is able to gain the experience by pre-training through trajectories on different navigation tasks and accelerate the training speed of DRL in new tasks. The proposed reward function keeps the optimal strategy of DRL unchanged. The simulation results on two static maps show that the DRL converge with less iterations via the learned reward function than the state-of-the-art mapless navigation methods. The proposed method performs well in dynamic maps with partially moving obstacles. Even when test maps are different from training maps, the proposed strategy is able to complete the navigation tasks without additional training.

Funder

Fundamental Research Funds for the Central Universities

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/13/3664/pdf

Reference55 articles.

1. Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age

2. Playing Atari with Deep Reinforcement Learning;Mnih;arXiv: Learning,2013

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Inspection Robot Navigation Based on Improved TD3 Algorithm;Sensors;2024-04-15

2. Implementation of PID controller and enhanced red deer algorithm in optimal path planning of substation inspection robots;Journal of Field Robotics;2024-04-05

3. New technologies for UAV navigation with real-time pattern recognition;Ain Shams Engineering Journal;2024-03

4. Deep Reinforcement Learning for Mapless Robot Navigation Systems;2023 Latin American Robotics Symposium (LARS), 2023 Brazilian Symposium on Robotics (SBR), and 2023 Workshop on Robotics in Education (WRE);2023-10-09

5. Immune deep reinforcement learning-based path planning for mobile robot in unknown environment;Applied Soft Computing;2023-09