Reinforcement learning with dynamic convex risk measures-Reference-Cited by-同舟云学术

Reinforcement learning with dynamic convex risk measures

Published:2023-04-17 Issue: Volume: Page:
ISSN:0960-1627
Container-title:Mathematical Finance
language:en
Short-container-title:Mathematical Finance

Author:

Coache Anthony¹,Jaimungal Sebastian¹²^ORCID

Affiliation:

1. Department of Statistical Sciences University of Toronto Toronto Canada

2. Oxford‐Man Institute University of Oxford Oxford United Kingdom

Abstract

AbstractWe develop an approach for solving time‐consistent risk‐sensitive stochastic optimization problems using model‐free reinforcement learning (RL). Specifically, we assume agents assess the risk of a sequence of random variables using dynamic convex risk measures. We employ a time‐consistent dynamic programming principle to determine the value of a particular policy, and develop policy gradient update rules that aid in obtaining optimal policies. We further develop an actor–critic style algorithm using neural networks to optimize over policies. Finally, we demonstrate the performance and flexibility of our approach by applying it to three optimization problems: statistical arbitrage trading strategies, financial hedging, and obstacle avoidance robot control.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Wiley

Subject

Applied Mathematics,Economics and Econometrics,Social Sciences (miscellaneous),Finance,Accounting

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/mafi.12388

Reference67 articles.

1. Acciaio B. &Penner I.(2011).Dynamic risk measures. InAdvanced mathematical methods for finance(pp. 1–34). Springer.

2. On the theory of policy gradient methods: Optimality, approximation, and distribution shift;Agarwal A.;Journal of Machine Learning Research,2021

3. Constrained Risk-Averse Markov Decision Processes

4. Al‐Aradi A. Correia A. Naiff D. Jardim G. &Saporito Y.(2018).Solving nonlinear and high‐dimensional partial differential equations via deep learning.arXiv preprint arXiv:1811.08782.

5. Coherent Measures of Risk

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Risk-Averse Markov Decision Processes Through a Distributional Lens;Mathematics of Operations Research;2024-07-17

2. Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance;Engineering Applications of Artificial Intelligence;2024-07

3. Markov decision processes with risk-sensitive criteria: an overview;Mathematical Methods of Operations Research;2024-04

4. CVA Hedging with Reinforcement Learning;4th ACM International Conference on AI in Finance;2023-11-25

5. Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning;SIAM Journal on Financial Mathematics;2023-11-14