Exploring communication protocols and centralized critics in multi-agent deep learning-Reference-Cited by-同舟云学术

Exploring communication protocols and centralized critics in multi-agent deep learning

Published:2020-09-11 Issue:4 Volume:27 Page:333-351
ISSN:1069-2509
Container-title:Integrated Computer-Aided Engineering
language:
Short-container-title:ICA

Author:

Simões David¹,Lau Nuno¹,Reis Luís Paulo²

Affiliation:

1. Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, Aveiro, Portugal

2. Artificial Intelligence and Computer Science Lab, Faculty of Engineering of the University of Porto, Porto, Portugal

Abstract

Tackling multi-agent environments where each agent has a local limited observation of the global state is a non-trivial task that often requires hand-tuned solutions. A team of agents coordinating in such scenarios must handle the complex underlying environment, while each agent only has partial knowledge about the environment. Deep reinforcement learning has been shown to achieve super-human performance in single-agent environments, and has since been adapted to the multi-agent paradigm. This paper proposes A3C3, a multi-agent deep learning algorithm, where agents are evaluated by a centralized referee during the learning phase, but remain independent from each other in actual execution. This referee’s neural network is augmented with a permutation invariance architecture to increase its scalability to large teams. A3C3 also allows agents to learn communication protocols with which agents share relevant information to their team members, allowing them to overcome their limited knowledge, and achieve coordination. A3C3 and its permutation invariant augmentation is evaluated in multiple multi-agent test-beds, which include partially-observable scenarios, swarm environments, and complex 3D soccer simulations.

Publisher

IOS Press

Subject

Artificial Intelligence,Computational Theory and Mathematics,Computer Science Applications,Theoretical Computer Science,Software

Reference70 articles.

1. Multi-object tracking with discriminant correlation filter based deep learning tracker;Yang;Integrated Computer-Aided Engineering,2019

2. Distributed control for 3D metamorphosis;Yim;Autonomous Robots,2001

3. Using multi-agent technology for the distributed management of a cluster of remote sensing satellites;Skobelev;Complex Systems: Fundamentals & Applications,2016

4. Mannion P, Duggan J, Howley E. An experimental review of reinforcement learning algorithms for adaptive traffic signal control. In: Autonomic Road Transport Support Systems. Springer, 2016. pp. 47-66.

5. Multi-agent replicator controller for sustainable vibration control of smart structures;Gutierrez Soto;J Vibroeng,2017

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Distributed Soccer Training Smart Sensors for Multitarget Localization and Tracking;Journal of Sensors;2022-08-05

2. A night pavement crack detection method based on image‐to‐image translation;Computer-Aided Civil and Infrastructure Engineering;2022-05-03

3. A smarthome conversational agent performing implicit demand-response application planning;Integrated Computer-Aided Engineering;2021-12-28

4. One-Dimensional Convolutional Neural Networks Combined with Channel Selection Strategy for Seizure Prediction Using Long-Term Intracranial EEG;International Journal of Neural Systems;2021-10-12

5. Hysteresis Modeling in Iron-Dominated Magnets Based on a Multi-Layered NARX Neural Network Approach;International Journal of Neural Systems;2021-07-22