Risk-Aware Model-Based Control-Reference-Cited by-同舟云学术

Risk-Aware Model-Based Control

Published:2021-03-11 Issue: Volume:8 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Yu Chen,Rosendo Andre

Abstract

Model-Based Reinforcement Learning (MBRL) algorithms have been shown to have an advantage on data-efficiency, but often overshadowed by state-of-the-art model-free methods in performance, especially when facing high-dimensional and complex problems. In this work, a novel MBRL method is proposed, called Risk-Aware Model-Based Control (RAMCO). It combines uncertainty-aware deep dynamics models and the risk assessment technique Conditional Value at Risk (CVaR). This mechanism is appropriate for real-world application since it takes epistemic risk into consideration. In addition, we use a model-free solver to produce warm-up training data, and this setting improves the performance in low-dimensional environments and covers the shortage of MBRL’s nature in the high-dimensional scenarios. In comparison with other state-of-the-art reinforcement learning algorithms, we show that it produces superior results on a walking robot model. We also evaluate the method with an Eidos environment, which is a novel experimental method with multi-dimensional randomly initialized deep neural networks to measure the performance of any reinforcement learning algorithm, and the advantages of RAMCO are highlighted.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Computer Science Applications

Reference69 articles.

1. Using inaccurate models in reinforcement learning;Abbeel,2006

2. Analysis of a natural gradient algorithm on monotonic convex-quadratic-composite functions;Akimoto,2012

3. Online model selection for restricted covariance matrix adaptation;Akimoto

4. Projection-based restricted covariance matrix adaptation for high dimension;Akimoto

5. Distributed distributional deterministic policy gradients;Barth-Maron,2018

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading;Frontiers in Robotics and AI;2022-05-31

2. Improving Model-Based Deep Reinforcement Learning with Learning Degree Networks and Its Application in Robot Control;Journal of Robotics;2022-03-04