Reinforcement Learning for Mean-Field Game-Reference-Cited by-同舟云学术

Reinforcement Learning for Mean-Field Game

Published:2022-02-22 Issue:3 Volume:15 Page:73
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Agarwal Mridul,Aggarwal Vaneet,Ghosh Arnob,Tiwari Nilay

Abstract

Stochastic games provide a framework for interactions among multiple agents and enable a myriad of applications. In these games, agents decide on actions simultaneously. After taking an action, the state of every agent updates to the next state, and each agent receives a reward. However, finding an equilibrium (if exists) in this game is often difficult when the number of agents becomes large. This paper focuses on finding a mean-field equilibrium (MFE) in an action-coupled stochastic game setting in an episodic framework. It is assumed that an agent can approximate the impact of the other agents’ by the empirical distribution of the mean of the actions. All agents know the action distribution and employ lower-myopic best response dynamics to choose the optimal oblivious strategy. This paper proposes a posterior sampling-based approach for reinforcement learning in the mean-field game, where each agent samples a transition probability from the previous transitions. We show that the policy and action distributions converge to the optimal oblivious strategy and the limiting distribution, respectively, which constitute an MFE.

Funder

National Science Foundation

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/15/3/73/pdf

Reference26 articles.

1. Cooperative Multi-Agent Learning: The State of the Art

2. Nash Q-learning for general-sum stochastic games;Hu;J. Mach. Learn. Res.,2003

3. Mean Field Multi-Agent Reinforcement Learning;Yang;arXiv,2018

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Quadcopter Autonomy: Implementing Advanced Control Strategies and Intelligent Trajectory Planning;Automation;2024-06-14

2. Model-Free Reinforcement Learning for Mean Field Games;IEEE Transactions on Control of Network Systems;2023-12

3. Machine learning driven extended matrix norm method for the solution of large-scale zero-sum matrix games;Journal of Computational Science;2023-04

4. On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC);J MACH LEARN RES;2022