Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human and Agent Demonstrations-Reference-Cited by-同舟云学术

Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human and Agent Demonstrations

Published:2019-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Wang Zhaodong¹,Taylor Matthew E.¹

Affiliation:

1. School of EECS, Washington State University

Abstract

Reinforcement learning has enjoyed multiple impressive successes in recent years. However, these successes typically require very large amounts of data before an agent achieves acceptable performance. This paper focuses on a novel way of combating such requirements by leveraging existing (human or agent) knowledge. In particular, this paper leverages demonstrations, allowing an agent to quickly achieve high performance. This paper introduces the Dynamic Reuse of Prior (DRoP) algorithm, which combines the offline knowledge (demonstrations recorded before learning) with online confidence-based performance analysis. DRoP leverages the demonstrator's knowledge by automatically balancing between reusing the prior knowledge and the current learned policy, allowing the agent to outperform the original demonstrations. We compare with multiple state-of-the-art learning algorithms and empirically show that DRoP can achieve superior performance in two domains. Additionally, we show that this confidence measure can be used to selectively request additional demonstrations, significantly improving the learning performance of the agent.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparing experience- and description-based economic preferences across 11 countries;Nature Human Behaviour;2024-06-14

2. Ask-AC: An Initiative Advisor-in-the-Loop Actor–Critic Framework;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2023-12

3. Safe Exploration in Reinforcement Learning for Learning from Human Experts;2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things (AIBThings);2023-09-16

4. A hierarchical deep reinforcement learning model with expert prior knowledge for intelligent penetration testing;Computers & Security;2023-09

5. Reinforcement learning from expert demonstrations with application to redundant robot control;Engineering Applications of Artificial Intelligence;2023-03