Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning-Reference-Cited by-同舟云学术

Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning

Published:2023-06-26 Issue:8 Volume:37 Page:10157-10165
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Wang Mingyang,Bing Zhenshan,Yao Xiangtong,Wang Shuai,Kai Huang,Su Hang,Yang Chenguang,Knoll Alois

Abstract

Meta-reinforcement learning enables artificial agents to learn from related training tasks and adapt to new tasks efficiently with minimal interaction data. However, most existing research is still limited to narrow task distributions that are parametric and stationary, and does not consider out-of-distribution tasks during the evaluation, thus, restricting its application. In this paper, we propose MoSS, a context-based Meta-reinforcement learning algorithm based on Self-Supervised task representation learning to address this challenge. We extend meta-RL to broad non-parametric task distributions which have never been explored before, and also achieve state-of-the-art results in non-stationary and out-of-distribution tasks. Specifically, MoSS consists of a task inference module and a policy module. We utilize the Gaussian mixture model for task representation to imitate the parametric and non-parametric task variations. Additionally, our online adaptation strategy enables the agent to react at the first sight of a task change, thus being applicable in non-stationary tasks. MoSS also exhibits strong generalization robustness in out-of-distributions tasks which benefits from the reliable and robust task representation. The policy is built on top of an off-policy RL algorithm and the entire network is trained completely off-policy to ensure high sample efficiency. On MuJoCo and Meta-World benchmarks, MoSS outperforms prior works in terms of asymptotic performance, sample efficiency (3-50x faster), adaptation efficiency, and generalization robustness on broad and diverse task distributions.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Access Edge Computing for Real-Time Applications With Sporadic DAG Tasks – A Graphical Game Approach;IEEE Transactions on Mobile Computing;2024-10

2. Optimizing Dynamic Balance in a Rat Robot via the Lateral Flexion of a Soft Actuated Spine;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

3. Contact Energy Based Hindsight Experience Prioritization;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

4. Learning from Symmetry: Meta-Reinforcement Learning with Symmetrical Behaviors and Language Instructions;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01

5. Meta-Reinforcement Learning via Language Instructions;2023 IEEE International Conference on Robotics and Automation (ICRA);2023-05-29