Bi-Level Actor-Critic for Multi-Agent Coordination-Reference-Cited by-同舟云学术

Bi-Level Actor-Critic for Multi-Agent Coordination

Published:2020-04-03 Issue:05 Volume:34 Page:7325-7332
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zhang Haifeng,Chen Weizhe,Huang Zeren,Li Minne,Yang Yaodong,Zhang Weinan,Wang Jun

Abstract

Coordination is one of the essential problems in multi-agent systems. Typically multi-agent reinforcement learning (MARL) methods treat agents equally and the goal is to solve the Markov game to an arbitrary Nash equilibrium (NE) when multiple equilibra exist, thus lacking a solution for NE selection. In this paper, we treat agents unequally and consider Stackelberg equilibrium as a potentially better convergence point than Nash equilibrium in terms of Pareto superiority, especially in cooperative environments. Under Markov games, we formally define the bi-level reinforcement learning problem in finding Stackelberg equilibrium. We propose a novel bi-level actor-critic learning method that allows agents to have different knowledge base (thus intelligent), while their actions still can be executed simultaneously and distributedly. The convergence proof is given, while the resulting learning algorithm is tested against the state of the arts. We found that the proposed bi-level actor-critic algorithm successfully converged to the Stackelberg equilibria in matrix games and find a asymmetric solution in a highway merge environment.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MADRL-Based DSO-Customer Coordinated Bi-Level Volt/VAR Optimization Method for Power Distribution Networks;IEEE Transactions on Sustainable Energy;2024-07

2. Stackelberg Game-Theoretic Trajectory Guidance for Multi-Robot Systems with Koopman Operator;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

3. Multi-agent Reinforcement Learning for Safe Driving in On-ramp Merging of Autonomous Vehicles;2024 14th International Conference on Cloud Computing, Data Science & Engineering (Confluence);2024-01-18

4. Reinforcement learning algorithms;Decision-Making Models;2024

5. Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04