Multi-Task Deep Reinforcement Learning with PopArt-Reference-Cited by-同舟云学术

Multi-Task Deep Reinforcement Learning with PopArt

Published:2019-07-17 Issue: Volume:33 Page:3796-3803
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Hessel Matteo,Soyer Hubert,Espeholt Lasse,Czarnecki Wojciech,Schmitt Simon,Van Hasselt Hado

Abstract

The reinforcement learning (RL) community has made great strides in designing algorithms capable of exceeding human performance on specific tasks. These algorithms are mostly trained one task at the time, each new task requiring to train a brand new agent instance. This means the learning algorithm is general, but each solution is not; each agent can only solve the one task it was trained on. In this work, we study the problem of learning to master not one but multiple sequentialdecision tasks at once. A general issue in multi-task learning is that a balance must be found between the needs of multiple tasks competing for the limited resources of a single learning system. Many learning algorithms can get distracted by certain tasks in the set of tasks to solve. Such tasks appear more salient to the learning process, for instance because of the density or magnitude of the in-task rewards. This causes the algorithm to focus on those salient tasks at the expense of generality. We propose to automatically adapt the contribution of each task to the agent’s updates, so that all tasks have a similar impact on the learning dynamics. This resulted in state of the art performance on learning to play all games in a set of 57 diverse Atari games. Excitingly, our method learned a single trained policy - with a single set of weights - that exceeds median human performance. To our knowledge, this was the first time a single agent surpassed human-level performance on this multi-task domain. The same approach also demonstrated state of the art performance on a set of 30 tasks in the 3D reinforcement learning platform DeepMind Lab.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 75 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Agent Reinforcement Learning Based Uplink OFDMA for IEEE 802.11ax Networks;IEEE Transactions on Wireless Communications;2024-08

2. LDM: A Generic Data-Driven Large Distribution Network Operation Model;IEEE Transactions on Smart Grid;2024-07

3. MORPH: Design Co-optimization with Reinforcement Learning via a Differentiable Hardware Model Proxy;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

4. Robot Skill Generalization: Feature-Selected Adaptation Transfer for Peg-in-Hole Assembly;IEEE Transactions on Industrial Electronics;2024-03

5. Automatic data augmentation for medical image segmentation using Adaptive Sequence-length based Deep Reinforcement Learning;Computers in Biology and Medicine;2024-02