Model-Based Reinforcement Learning in Continuous Environments Using Real-Time Constrained Optimization-Reference-Cited by-同舟云学术

Model-Based Reinforcement Learning in Continuous Environments Using Real-Time Constrained Optimization

Published:2015-02-21 Issue:1 Volume:29 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Andersson Olov,Heintz Fredrik,Doherty Patrick

Abstract

Reinforcement learning for robot control tasks in continuous environments is a challenging problem due to the dimensionality of the state and action spaces, time and resource costs for learning with a real robot as well as constraints imposed for its safe operation. In this paper we propose a model-based reinforcement learning approach for continuous environments with constraints. The approach combines model-based reinforcement learning with recent advances in approximate optimal control. This results in a bounded-rationality agent that makes decisions in real-time by efficiently solving a sequence of constrained optimization problems on learned sparse Gaussian process models. Such a combination has several advantages. No high-dimensional policy needs to be computed or stored while the learning problem often reduces to a set of lower-dimensional models of the dynamics. In addition, hard constraints can easily be included and objectives can also be changed in real-time to allow for multiple or dynamic tasks. The efficacy of the approach is demonstrated on both an extended cart pole domain and a challenging quadcopter navigation task using real data.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Controlling FES of arm movements using physics-informed reinforcement learning via co-kriging adjustment;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

2. Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty;SSRN Electronic Journal;2024

3. Integrating Scientific Knowledge with Machine Learning for Engineering and Environmental Systems;ACM Computing Surveys;2022-11-21