Handling Realistic Noise in Multi-Agent Systems with Self-Supervised Learning and Curiosity-Reference-Cited by-同舟云学术

Handling Realistic Noise in Multi-Agent Systems with Self-Supervised Learning and Curiosity

Published:2021-04-01 Issue:2 Volume:12 Page:135-148
ISSN:2449-6499
Container-title:Journal of Artificial Intelligence and Soft Computing Research
language:en
Short-container-title:

Author:

Szemenyei Márton¹,Reizinger Patrik¹

Affiliation:

1. Department of Control Engineering and Information Technology , Budapest University of Technology and Economics , 1117, Budapest, Magyar Tudosok krt. 2.

Abstract

Abstract 1Most reinforcement learning benchmarks – especially in multi-agent tasks – do not go beyond observations with simple noise; nonetheless, real scenarios induce more elaborate vision pipeline failures: false sightings, misclassifications or occlusion. In this work, we propose a lightweight, 2D environment for robot soccer and autonomous driving that can emulate the above discrepancies. Besides establishing a benchmark for accessible multi-agent reinforcement learning research, our work addresses the challenges the simulator imposes. For handling realistic noise, we use self-supervised learning to enhance scene reconstruction and extend curiosity-driven learning to model longer horizons. Our extensive experiments show that the proposed methods achieve state-of-the-art performance, compared against actor-critic methods, ICM, and PPO.

Publisher

Walter de Gruyter GmbH

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Hardware and Architecture,Modeling and Simulation,Information Systems

Link

https://www.sciendo.com/pdf/10.2478/jaiscr-2022-0009

Reference36 articles.

1. [1] Bowen Baker, Ingmar Kanitscheider, Todor M. Markov, Yi Wu, Glenn Powell, Bob McGrew, and Igor Mordatch. Emergent tool use from multi-agent autocurricula. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, 2020.

2. [2] Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba. Openai gym. 6 2016.

3. [3] Yuri Burda, Harrison Edwards, Deepak Pathak, Amos J. Storkey, Trevor Darrell, and Alexei A. Efros. Large-scale study of curiosity-driven learning. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019, 2019.

4. [4] Carl Doersch, Abhinav Gupta, and Alexei A. Efros. Unsupervised visual representation learning by context prediction. May 2015.10.1109/ICCV.2015.167

5. [5] Jeff Donahue, Philipp Krahenbahl, and Trevor Darrell. Adversarial feature learning. May 2016.