Learning Representations in Model-Free Hierarchical Reinforcement Learning-Reference-Cited by-同舟云学术

Learning Representations in Model-Free Hierarchical Reinforcement Learning

Published:2019-07-17 Issue: Volume:33 Page:10009-10010
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Rafati Jacob,Noelle David C.

Abstract

Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale applications involving huge state spaces and sparse delayed reward feedback. Hierarchical Reinforcement Learning (HRL) methods attempt to address this scalability issue by learning action selection policies at multiple levels of temporal abstraction. Abstraction can be had by identifying a relatively small set of states that are likely to be useful as subgoals, in concert with the learning of corresponding skill policies to achieve those subgoals. Many approaches to subgoal discovery in HRL depend on the analysis of a model of the environment, but the need to learn such a model introduces its own problems of scale. Once subgoals are identified, skills may be learned through intrinsic motivation, introducing an internal reward signal marking subgoal attainment. We present a novel model-free method for subgoal discovery using incremental unsupervised learning over a small memory of the most recent experiences of the agent. When combined with an intrinsic motivation learning mechanism, this method learns subgoals and skills together, based on experiences in the environment. Thus, we offer an original approach to HRL that does not require the acquisition of a model of the environment, suitable for large-scale applications. We demonstrate the efficiency of our method on a variant of the rooms environment.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Work Together to Keep Fresh: Hierarchical Learning for UAVs-assisted Data Time-Sensitive IoT;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. A goal-oriented reinforcement learning for optimal drug dosage control;Annals of Operations Research;2024-05-09

3. Past Data-Driven Adaptation in Hierarchical Reinforcement Learning;Proceedings of the 2024 16th International Conference on Machine Learning and Computing;2024-02-02

4. HC-API: A Hierarchical Collaborative Agent Permutation Invariant Framework for Multi-agent Reinforcement Learning;Lecture Notes in Electrical Engineering;2024

5. Towards efficient long-horizon decision-making using automated structure search method of hierarchical reinforcement learning for edge artificial intelligence;Internet of Things;2023-12