Distributional Reinforcement Learning With Quantile Regression-Reference-Cited by-同舟云学术

Distributional Reinforcement Learning With Quantile Regression

Published:2018-04-29 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Dabney Will,Rowland Mark,Bellemare Marc,Munos Rémi

Abstract

In reinforcement learning (RL), an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the observed long-term return. Traditionally, reinforcement learning algorithms average over this randomness to estimate the value function. In this paper, we build on recent work advocating a distributional approach to reinforcement learning in which the distribution over returns is modeled explicitly instead of only estimating the mean. That is, we examine methods of learning the value distribution instead of the value function. We give results that close a number of gaps between the theoretical and algorithmic results given by Bellemare, Dabney, and Munos (2017). First, we extend existing results to the approximate distribution setting. Second, we present a novel distributional reinforcement learning algorithm consistent with our theoretical formulation. Finally, we evaluate this new algorithm on the Atari 2600 games, observing that it significantly outperforms many of the recent improvements on DQN, including the related distributional algorithm C51.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 149 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive pessimism via target Q-value for offline reinforcement learning;Neural Networks;2024-12

2. Reinforcement learning for electric vehicle charging scheduling: A systematic review;Transportation Research Part E: Logistics and Transportation Review;2024-10

3. Attention-Based Distributional Reinforcement Learning for Safe and Efficient Autonomous Driving;IEEE Robotics and Automation Letters;2024-09

4. Adversarial robustness of deep reinforcement learning-based intrusion detection;International Journal of Information Security;2024-08-29

5. Combining transformer based deep reinforcement learning with Black-Litterman model for portfolio optimization;Neural Computing and Applications;2024-08-10