Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems-Reference-Cited by-同舟云学术

Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems

Published:2021-08-31 Issue:9 Volume:23 Page:1133
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Gu Shanzhi,Geng Mingyang^ORCID,Lan Long

Abstract

The aim of multi-agent reinforcement learning systems is to provide interacting agents with the ability to collaboratively learn and adapt to the behavior of other agents. Typically, an agent receives its private observations providing a partial view of the true state of the environment. However, in realistic settings, the harsh environment might cause one or more agents to show arbitrarily faulty or malicious behavior, which may suffice to allow the current coordination mechanisms fail. In this paper, we study a practical scenario of multi-agent reinforcement learning systems considering the security issues in the presence of agents with arbitrarily faulty or malicious behavior. The previous state-of-the-art work that coped with extremely noisy environments was designed on the basis that the noise intensity in the environment was known in advance. However, when the noise intensity changes, the existing method has to adjust the configuration of the model to learn in new environments, which limits the practical applications. To overcome these difficulties, we present an Attention-based Fault-Tolerant (FT-Attn) model, which can select not only correct, but also relevant information for each agent at every time step in noisy environments. The multihead attention mechanism enables the agents to learn effective communication policies through experience concurrent with the action policies. Empirical results showed that FT-Attn beats previous state-of-the-art methods in some extremely noisy environments in both cooperative and competitive scenarios, much closer to the upper-bound performance. Furthermore, FT-Attn maintains a more general fault tolerance ability and does not rely on the prior knowledge about the noise intensity of the environment.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/9/1133/pdf

Reference34 articles.

1. Learning to cooperate in decentralized multirobot exploration of dynamic environments;Geng,2018

2. A Multiagent Approach to Autonomous Intersection Management

3. Learning to Cooperate via an Attention-Based Communication Neural Network in Decentralized Multi-Robot Exploration

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An overview: Attention mechanisms in multi-agent reinforcement learning;Neurocomputing;2024-09

2. Leveraging heterogeneous networks to analyze energy storage systems in power systems and renewable energy research: a scientometric study;Frontiers in Energy Research;2024-07-11

3. Reinforcement learning for multi-agent with asynchronous missing information fusion method;International Journal of Machine Learning and Cybernetics;2024-06-07

4. Fault-Tolerant Control for Multi-UAV Exploration System via Reinforcement Learning Algorithm;Aerospace;2024-05-08

5. State Super Sampling Soft Actor–Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment;Journal of Marine Science and Engineering;2023-06-21