Risk-Sensitive Policy with Distributional Reinforcement Learning-Reference-Cited by-同舟云学术

Risk-Sensitive Policy with Distributional Reinforcement Learning

Published:2023-06-30 Issue:7 Volume:16 Page:325
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Théate Thibaut¹^ORCID,Ernst Damien¹²

Affiliation:

1. Department of Electrical Engineering and Computer Science, University of Liège, 4031 Liège, Belgium

2. Information Processing and Communications Laboratory, Institut Polytechnique de Paris, 91120 Paris, France

Abstract

Classical reinforcement learning (RL) techniques are generally concerned with the design of decision-making policies driven by the maximisation of the expected outcome. Nevertheless, this approach does not take into consideration the potential risk associated with the actions taken, which may be critical in certain applications. To address that issue, the present research work introduces a novel methodology based on distributional RL to derive sequential decision-making policies that are sensitive to the risk, the latter being modelled by the tail of the return probability distribution. The core idea is to replace the Q function generally standing at the core of learning schemes in RL by another function, taking into account both the expected return and the risk. Named the risk-based utility function U, it can be extracted from the random return distribution Z naturally learnt by any distributional RL algorithm. This enables the spanning of the complete potential trade-off between risk minimisation and expected return maximisation, in contrast to fully risk-averse methodologies. Fundamentally, this research yields a truly practical and accessible solution for learning risk-sensitive policies with minimal modification to the distributional RL algorithm, with an emphasis on the interpretability of the resulting decision-making process.

Funder

Thibaut Théate

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/16/7/325/pdf

Reference28 articles.

1. Sutton, R.S., and Barto, A.G. (2018). Reinforcement Learning: An Introduction, MIT Press.

2. Technical Note: Q-Learning;Watkins;Mach. Learn.,1992

3. Challenges of real-world reinforcement learning: Definitions, benchmarks and analysis;Levine;Mach. Learn.,2021

4. Guidelines for reinforcement learning in healthcare;Gottesman;Nat. Med.,2019

5. An application of deep reinforcement learning to algorithmic trading;Ernst;Expert Syst. Appl.,2021

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dopamine neurons encode a multidimensional probabilistic map of future reward;2023-11-13

2. Offline reinforcement learning in high-dimensional stochastic environments;Neural Computing and Applications;2023-10-11