Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning

Published:2022-08-04 Issue:15 Volume:22 Page:5845
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Jang Sooyoung^ORCID,Kim Hyung-Il^ORCID

Abstract

Effective exploration is one of the critical factors affecting performance in deep reinforcement learning. Agents acquire data to learn the optimal policy through exploration, and if it is not guaranteed, the data quality deteriorates, which leads to performance degradation. This study investigates the effect of initial entropy, which significantly influences exploration, especially in the early learning stage. The results of this study on tasks with discrete action space show that (1) low initial entropy increases the probability of learning failure, (2) the distributions of initial entropy for various tasks are biased towards low values that inhibit exploration, and (3) the initial entropy for discrete action space varies with both the initial weight and task, making it hard to control. We then devise a simple yet powerful learning strategy to deal with these limitations, namely, entropy-aware model initialization. The proposed algorithm aims to provide a model with high initial entropy to a deep reinforcement learning algorithm for effective exploration. Our experiments showed that the devised learning strategy significantly reduces learning failures and enhances performance, stability, and learning speed.

Funder

Electronics and Telecommunications Research Institute

Institute for Information and Communications Technology Promotion

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/15/5845/pdf

Reference39 articles.

1. Deep Reinforcement Learning: A Brief Survey

2. Hierarchical Deep Reinforcement Learning for Continuous Action Control

3. Composable Deep Reinforcement Learning for Robotic Manipulation;Haarnoja;Proceedings of the IEEE International Conference on Robotics and Automation (ICRA),2018

4. Neural network based reinforcement learning for audio–visual gaze control in human–robot interaction

5. Prioritized Environment Configuration for Drone Control with Deep Reinforcement Learning;Jang;Hum. Centric Comput. Inf. Sci.,2022

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep learning on medical image analysis;CAAI Transactions on Intelligence Technology;2024-06-24

2. Real-Time Scheduling of Pumps in Water Distribution Systems Based on Exploration-Enhanced Deep Reinforcement Learning;Systems;2023-01-20