Accelerating Reinforcement Learning through Implicit Imitation-Reference-Cited by-同舟云学术

Accelerating Reinforcement Learning through Implicit Imitation

Published:2003-12-01 Issue: Volume:19 Page:569-629
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Price B.,Boutilier C.

Abstract

Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent's ability to learn useful behaviors by making intelligent use of the knowledge implicit in behaviors demonstrated by cooperative teachers or other more experienced agents. We propose and study a formal model of implicit imitation that can accelerate reinforcement learning dramatically in certain cases. Roughly, by observing a mentor, a reinforcement-learning agent can extract information about its own capabilities in, and the relative value of, unvisited parts of the state space. We study two specific instantiations of this model, one in which the learning agent and the mentor have identical abilities, and one designed to deal with agents and mentors with different action sets. We illustrate the benefits of implicit imitation by integrating it with prioritized sweeping, and demonstrating improved performance and convergence through observation of single and multiple mentors. Though we make some stringent assumptions regarding observability and possible interactions, we briefly comment on extensions of the model that relax these restricitions.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 82 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An incremental learning approach to dynamic parallel machine scheduling with sequence-dependent setups and machine eligibility restrictions;Applied Soft Computing;2024-10

2. Reinforcement Learning Review: Past Acts, Present Facts and Future Prospects;IT Journal Research and Development;2024-02-15

3. A decision-making of autonomous driving method based on DDPG with pretraining;Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering;2024-01-29

4. Multi-Agent Reinforcement Learning for Traffic Flow Management of Autonomous Vehicles;Sensors;2023-02-21

5. Implicit Continuous User Authentication for Mobile Devices based on Deep Reinforcement Learning;Computer Systems Science and Engineering;2023