Decentralised Learning in Systems With Many, Many Strategic Agents-Reference-Cited by-同舟云学术

Decentralised Learning in Systems With Many, Many Strategic Agents

Published:2018-04-26 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Mguni David,Jennings Joel,Munoz de Cote Enrique

Abstract

Although multi-agent reinforcement learning can tackle systems of strategically interacting entities, it currently fails in scalability and lacks rigorous convergence guarantees. Crucially, learning in multi-agent systems can become intractable due to the explosion in the size of the state-action space as the number of agents increases. In this paper, we propose a method for computing closed-loop optimal policies in multi-agent systems that scales independently of the number of agents. This allows us to show, for the first time, successful convergence to optimal behaviour in systems with an unbounded number of interacting adaptive learners. Studying the asymptotic regime of N-player stochastic games, we devise a learning protocol that is guaranteed to converge to equilibrium policies even when the number of agents is extremely large. Our method is model-free and completely decentralised so that each agent need only observe its local state information and its realised rewards. We validate these theoretical results by showing convergence to Nash-equilibrium policies in applications from economics and control theory with thousands of strategically interacting agents.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Implementing a Hierarchical Deep Learning Approach for Simulating Multilevel Auction Data;Computational Economics;2024-05-18

2. Scalable Learning for Spatiotemporal Mean Field Games Using Physics-Informed Neural Operator;Mathematics;2024-03-08

3. Mean-Field Learning for Day-to-Day Departure Time Choice with Mode Switching;2023 62nd IEEE Conference on Decision and Control (CDC);2023-12-13

4. Decision making in open agent systems;AI Magazine;2023-10-09

5. LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning;Neural Networks;2023-10