Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents-Reference-Cited by-同舟云学术

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents

Published:2018-03-19 Issue: Volume:61 Page:523-562
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Machado Marlos C.,Bellemare Marc G.,Talvitie Erik,Veness Joel,Hausknecht Matthew,Bowling Michael

Abstract

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of different problem settings and it has been receiving increasing attention from the scientific community, leading to some high-profile success stories such as the much publicized Deep Q-Networks (DQN). In this article we take a big picture look at how the ALE is being used by the research community. We show how diverse the evaluation methodologies in the ALE have become with time, and highlight some key concerns when evaluating agents in the ALE. We use this discussion to present some methodological best practices and provide new benchmark results using these best practices. To further the progress in the field, we introduce a new version of the ALE that supports multiple game modes and provides a form of stochasticity we call sticky actions. We conclude this big picture look by revisiting challenges posed when the ALE was introduced, summarizing the state-of-the-art in various problems and highlighting problems that remain open.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 94 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Addressing maximization bias in reinforcement learning with two-sample testing;Artificial Intelligence;2024-11

2. A multi-step on-policy deep reinforcement learning method assisted by off-policy policy evaluation;Applied Intelligence;2024-09-09

3. Searching for a Diversity of Interpretable Graph Control Policies;Proceedings of the Genetic and Evolutionary Computation Conference;2024-07-14

4. Distributional Reinforcement Learning with Sample-set Bellman Update;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

5. Investigating the properties of neural network representations in reinforcement learning;Artificial Intelligence;2024-05