Learning to Teach Reinforcement Learning Agents-Reference-Cited by-同舟云学术

Learning to Teach Reinforcement Learning Agents

Published:2017-12-06 Issue:1 Volume:1 Page:21-42
ISSN:2504-4990
Container-title:Machine Learning and Knowledge Extraction
language:en
Short-container-title:MAKE

Author:

Fachantidis Anestis,Taylor Matthew^ORCID,Vlahavas Ioannis

Abstract

In this article, we study the transfer learning model of action advice under a budget. We focus on reinforcement learning teachers providing action advice to heterogeneous students playing the game of Pac-Man under a limited advice budget. First, we examine several critical factors affecting advice quality in this setting, such as the average performance of the teacher, its variance and the importance of reward discounting in advising. The experiments show that the best performers are not always the best teachers and reveal the non-trivial importance of the coefficient of variation (CV) as a statistic for choosing policies that generate advice. The CV statistic relates variance to the corresponding mean. Second, the article studies policy learning for distributing advice under a budget. Whereas most methods in the relevant literature rely on heuristics for advice distribution, we formulate the problem as a learning one and propose a novel reinforcement learning algorithm capable of learning when to advise or not. The proposed algorithm is able to advise even when it does not have knowledge of the student’s intended action and needs significantly less training time compared to previous learning approaches. Finally, in this article, we argue that learning to advise under a budget is an instance of a more generic learning problem: Constrained Exploitation Reinforcement Learning.

Publisher

MDPI AG

Subject

General Economics, Econometrics and Finance

Link

http://www.mdpi.com/2504-4990/1/1/2/pdf

Reference25 articles.

1. Reinforcement Learning, An Introduction;Sutton,1998

2. Transfer Learning for Reinforcement Learning Domains: A Survey;Taylor;J. Mach. Learn. Res.,2009

3. Transfer in reinforcement learning: A framework and a survey;Lazaric,2012

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A location-based advising method in teacher–student frameworks;Knowledge-Based Systems;2024-02

2. Robust multi-agent reinforcement learning via Bayesian distributional value estimation;Pattern Recognition;2024-01

3. Explainable Action Advising for Multi-Agent Reinforcement Learning;2023 IEEE International Conference on Robotics and Automation (ICRA);2023-05-29

4. Location-Based Real-Time Updated Advising Method for Traffic Signal Control;IEEE Internet of Things Journal;2023

5. Learning by reusing previous advice: a memory-based teacher–student framework;Autonomous Agents and Multi-Agent Systems;2022-12-29