Transformable Gaussian Reward Function for Socially Aware Navigation Using Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Transformable Gaussian Reward Function for Socially Aware Navigation Using Deep Reinforcement Learning

Published:2024-07-13 Issue:14 Volume:24 Page:4540
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Kim Jinyeob¹^ORCID,Kang Sumin²^ORCID,Yang Sungwoo²^ORCID,Kim Beomjoon¹^ORCID,Yura Jargalbaatar²^ORCID,Kim Donghan²^ORCID

Affiliation:

1. Department of Artificial Intelligence, College of Software, Kyung Hee University, Yongin 17104, Republic of Korea

2. Department of Electronic Engineering (AgeTech-Service Convergence Major), College of Electronics & Information, Kyung Hee University, Yongin 17104, Republic of Korea

Abstract

Robot navigation has transitioned from avoiding static obstacles to adopting socially aware navigation strategies for coexisting with humans. Consequently, socially aware navigation in dynamic, human-centric environments has gained prominence in the field of robotics. One of the methods for socially aware navigation, the reinforcement learning technique, has fostered its advancement. However, defining appropriate reward functions, particularly in congested environments, holds a significant challenge. These reward functions, crucial for guiding robot actions, necessitate intricate human-crafted design due to their complex nature and inability to be set automatically. The multitude of manually designed reward functions contains issues such as hyperparameter redundancy, imbalance, and inadequate representation of unique object characteristics. To address these challenges, we introduce a transformable Gaussian reward function (TGRF). The TGRF possesses two main features. First, it reduces the burden of tuning by utilizing a small number of hyperparameters that function independently. Second, it enables the application of various reward functions through its transformability. Consequently, it exhibits high performance and accelerated learning rates within the deep reinforcement learning (DRL) framework. We also validated the performance of TGRF through simulations and experiments.

Funder

MSI

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/14/4540/pdf

Reference47 articles.

1. Mobile robot obstacle avoidance via depth from focus;Nourbakhsh;Robot. Auton. Syst.,1997

2. Ulrich, I., and Borenstein, J. (1998, January 20). VFH+: Reliable obstacle avoidance for fast mobile robots. Proceedings of the 1998 IEEE International Conference on Robotics and Automation (Cat. No. 98CH36146), Leuven, Belgium.

3. Stereovision-based fuzzy obstacle avoidance method;Nalpantidis;Int. J. Humanoid Robot.,2011

4. Non-probabilistic cellular automata-enhanced stereo vision simultaneous localization and mapping;Nalpantidis;Meas. Sci. Technol.,2011

5. Pritsker, A.A.B. (1995). Introduction to Simulation and SLAM II, John Wiley & Sons, Inc.