Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms-Reference-Cited by-同舟云学术

Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

Published:2022-06-28 Issue:8 Volume:36 Page:9217-9224
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zheng Liyuan,Fiez Tanner,Alumbaugh Zane,Chasnov Benjamin,Ratliff Lillian J.

Abstract

The hierarchical interaction between the actor and critic in actor-critic based reinforcement learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this viewpoint and model the actor and critic interaction as a two-player general-sum game with a leader-follower structure known as a Stackelberg game. Given this abstraction, we propose a meta-framework for Stackelberg actor-critic algorithms where the leader player follows the total derivative of its objective instead of the usual individual gradient. From a theoretical standpoint, we develop a policy gradient theorem for the refined update and provide a local convergence guarantee for the Stackelberg actor-critic algorithms to a local Stackelberg equilibrium. From an empirical standpoint, we demonstrate via simple examples that the learning dynamics we study mitigate cycling and accelerate convergence compared to the usual gradient dynamics given cost structures induced by actor-critic formulations. Finally, extensive experiments on OpenAI gym environments show that Stackelberg actor-critic algorithms always perform at least as well and often significantly outperform the standard actor-critic algorithm counterparts.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A systematic review and meta-analysis of machine learning, deep learning, and ensemble learning approaches in predicting EV charging behavior;Engineering Applications of Artificial Intelligence;2024-09

2. Learning in Stochastic Stackelberg Games;2024 American Control Conference (ACC);2024-07-10

3. Recent Developments of Game Theory and Reinforcement Learning Approaches: A Systematic Review;IEEE Access;2024

4. Multi-view reinforcement learning for sequential decision-making with insufficient state information;International Journal of Machine Learning and Cybernetics;2023-10-24

5. A Weighted Mean Field Reinforcement Learning Algorithm for Large-Scale Multi-Agent Collaboration;Guidance, Navigation and Control;2023-06