Universal Reinforcement Learning Algorithms: Survey and Experiments-Reference-Cited by-同舟云学术

Universal Reinforcement Learning Algorithms: Survey and Experiments

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Aslanides John¹,Leike Jan²,Hutter Marcus¹

Affiliation:

1. Australian National University

2. Future of Humanity Institute, University of Oxford

Abstract

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms that make as few assumptions as possible about the environment. The universal Bayesian agent AIXI and a family of related URL algorithms have been developed in this setting. While numerous theoretical optimality results have been proven for these agents, there has been no empirical investigation of their behavior to date. We present a short and accessible survey of these URL algorithms under a unified notation and framework, along with results of some experiments that qualitatively illustrate some properties of the resulting policies, and their relative performance on partially-observable gridworld environments. We also present an open- source reference implementation of the algorithms which we hope will facilitate further understanding of, and experimentation with, these ideas.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How to Design Reinforcement Learning Methods for the Edge: An Integrated Approach toward Intelligent Decision Making;Electronics;2024-03-29

2. Offline Policy Comparison Under Limited Historical Agent-Environment Interactions;Lecture Notes in Computational Science and Engineering;2024

3. AIXI, FEP-AI, and Integrated World Models: Towards a Unified Understanding of Intelligence and Consciousness;Active Inference;2023

4. Computing Complexity-aware Plans Using Kolmogorov Complexity;2021 60th IEEE Conference on Decision and Control (CDC);2021-12-14

5. A Review of Supportive Computational Approaches for Neurological Disorder Identification;Interdisciplinary Approaches to Altering Neurodevelopmental Disorders;2020