Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems-Reference-Cited by-同舟云学术

Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems

Published:2012-02-22 Issue:1 Volume:27 Page:1-31
ISSN:0269-8889
Container-title:The Knowledge Engineering Review
language:en
Short-container-title:The Knowledge Engineering Review

Author:

Matignon Laetitia,Laurent Guillaume J.,Le Fort-Piat Nadine

Abstract

AbstractIn the framework of fully cooperative multi-agent systems, independent (non-communicative) agents that learn by reinforcement must overcome several difficulties to manage to coordinate. This paper identifies several challenges responsible for the non-coordination of independent agents: Pareto-selection, non-stationarity, stochasticity, alter-exploration and shadowed equilibria. A selection of multi-agent domains is classified according to those challenges: matrix games, Boutilier's coordination game, predators pursuit domains and a special multi-state game. Moreover, the performance of a range of algorithms for independent reinforcement learners is evaluated empirically. Those algorithms are Q-learning variants: decentralized Q-learning, distributed Q-learning, hysteretic Q-learning, recursive frequency maximum Q-value and win-or-learn fast policy hill climbing. An overview of the learning algorithms’ strengths and weaknesses against each challenge concludes the paper and can serve as a basis for choosing the appropriate algorithm for a new domain. Furthermore, the distilled challenges may assist in the design of new learning algorithms that overcome these problems and achieve higher performance in multi-agent applications.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Software

Reference62 articles.

1. Q-learning

2. Evolutionary game theory and multi-agent reinforcement learning

3. Tumer K. , Agogino A. 2007. Distributed agent-based air traffic flow management In AAMAS ‘07: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, 1–8. ACM.

4. Wolpert D. H. , Tumer K. 1999. An Introduction to Collective Intelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center.

Cited by 227 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Centralised rehearsal of decentralised cooperation: Multi-agent reinforcement learning for the scalable coordination of residential energy flexibility;Applied Energy;2025-01

2. A review of research on reinforcement learning algorithms for multi-agents;Neurocomputing;2024-09

3. An overview: Attention mechanisms in multi-agent reinforcement learning;Neurocomputing;2024-09

4. Deep Reinforcement Learning-based Sum-Rate Maximization in Hybrid Beamforming Multi-User Massive MIMO Systems;2024 Tenth International Conference on Communications and Electronics (ICCE);2024-07-31

5. Reinforcing Inter-Class Dependencies in the Asymmetric Island Model;Proceedings of the Genetic and Evolutionary Computation Conference;2024-07-14