Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics-Reference-Cited by-同舟云学术

Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics

Published:2023-03 Issue:3 Volume:42 Page:123-146
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Rana Krishan¹^ORCID,Dasagi Vibhavari¹,Haviland Jesse¹^ORCID,Talbot Ben¹^ORCID,Milford Michael¹^ORCID,Sünderhauf Niko¹^ORCID

Affiliation:

1. Queensland University of Technology (QUT) Centre for Robotics, Brisbane, Australia

Abstract

We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable but suboptimal control priors exist for many tasks, but RL from scratch remains unsafe and data-inefficient. By fusing uncertainty-aware distributional outputs from each system, BCF arbitrates control between them, exploiting their respective strengths. We study BCF on two real-world robotics tasks involving navigation in a vast and long-horizon environment, and a complex reaching task that involves manipulability maximisation. For both these domains, simple handcrafted controllers exist that can solve the task at hand in a risk-averse manner but do not necessarily exhibit the optimal solution given limitations in analytical modelling, controller miscalibration and task variation. As exploration is naturally guided by the prior in the early stages of training, BCF accelerates learning, while substantially improving beyond the performance of the control prior, as the policy gains more experience. More importantly, given the risk-aversity of the control prior, BCF ensures safe exploration and deployment, where the control prior naturally dominates the action distribution in states unknown to the policy. We additionally show BCF’s applicability to the zero-shot sim-to-real setting and its ability to deal with out-of-distribution states in the real world. BCF is a promising approach towards combining the complementary strengths of deep RL and traditional robotic control, surpassing what either can achieve independently. The code and supplementary video material are made publicly available at https://krishanrana.github.io/bcf .

Funder

Queensland University of Technology (QUT) Centre for Robotics

Australian Research Council Centre of Excellence for Robotic Vision

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/02783649231167210

Reference64 articles.

1. Anderson P, Chang A, Chaplot DS, et al. (2018) On evaluation of embodied navigation agents. arXiv preprint arXiv:1807.06757.

2. Learning dexterous in-hand manipulation

3. Using Finite State Machines in Introductory Robotics

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Strangeness-driven exploration in multi-agent reinforcement learning;Neural Networks;2024-04

2. Millimeter-Level Pick and Peg-in-Hole Task Achieved by Aerial Manipulator;IEEE Transactions on Robotics;2024

3. Physics-Model-Regulated Deep Reinforcement Learning Towards Safety & Stability Guarantees;2023 62nd IEEE Conference on Decision and Control (CDC);2023-12-13

4. Skill Fusion in Hybrid Robotic Framework for Visual Object Goal Navigation;Robotics;2023-07-16