Robust Reinforcement Learning: A Review of Foundations and Recent Advances-Reference-Cited by-同舟云学术

Robust Reinforcement Learning: A Review of Foundations and Recent Advances

Published:2022-03-19 Issue:1 Volume:4 Page:276-315
ISSN:2504-4990
Container-title:Machine Learning and Knowledge Extraction
language:en
Short-container-title:MAKE

Author:

Moos Janosch^ORCID,Hansel Kay^ORCID,Abdulsamad Hany,Stark Svenja,Clever Debora,Peters Jan

Abstract

Reinforcement learning (RL) has become a highly successful framework for learning in Markov decision processes (MDP). Due to the adoption of RL in realistic and complex environments, solution robustness becomes an increasingly important aspect of RL deployment. Nevertheless, current RL algorithms struggle with robustness to uncertainty, disturbances, or structural changes in the environment. We survey the literature on robust approaches to reinforcement learning and categorize these methods in four different ways: (i) Transition robust designs account for uncertainties in the system dynamics by manipulating the transition probabilities between states; (ii) Disturbance robust designs leverage external forces to model uncertainty in the system behavior; (iii) Action robust designs redirect transitions of the system by corrupting an agent’s output; (iv) Observation robust designs exploit or distort the perceived system state of the policy. Each of these robust designs alters a different aspect of the MDP. Additionally, we address the connection of robustness to the risk-based and entropy-regularized RL formulations. The resulting survey covers all fundamental concepts underlying the approaches to robust reinforcement learning and their recent advances.

Publisher

MDPI AG

Subject

General Economics, Econometrics and Finance

Link

https://www.mdpi.com/2504-4990/4/1/13/pdf

Reference162 articles.

1. Reinforcement Learning: An Introduction;Sutton,2018

2. Markov Decision Processes: Discrete Stochastic Dynamic Programming;Puterman,2014

3. Feedback Control of Dynamic Systems;Franklin,1994

4. A brief history of automatic control;Bennett;IEEE Control Syst. Mag.,1996

5. Optimal control-1950 to 1985

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A proximal policy optimization approach for food delivery problem with reassignment due to order cancellation;Expert Systems with Applications;2024-12

2. Reinforcement learning for HVAC control in intelligent buildings: A technical and conceptual review;Journal of Building Engineering;2024-10

3. Tolerance of Reinforcement Learning Controllers Against Deviations in Cyber Physical Systems;Lecture Notes in Computer Science;2024-09-13

4. Robuste Lernmethoden bei Unsicherheiten im Zustandsraum;maschinenbau;2024-08

5. Variational quantum circuit learning-enabled robust optimization for AI data center energy control and decarbonization;Advances in Applied Energy;2024-07