Efficient policy detecting and reusing for non-stationarity in Markov games-Reference-Cited by-同舟云学术

Efficient policy detecting and reusing for non-stationarity in Markov games

Published:2020-10-26 Issue:1 Volume:35 Page:
ISSN:1387-2532
Container-title:Autonomous Agents and Multi-Agent Systems
language:en
Short-container-title:Auton Agent Multi-Agent Syst

Author:

Zheng Yan^ORCID,Hao Jianye,Zhang Zongzhang,Meng Zhaopeng,Yang Tianpei,Li Yanran,Fan Changjie

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10458-020-09480-9.pdf

Reference41 articles.

1. Albrecht, S. V., & Stone, P. (2018). Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence, 258, 66–95.

2. Banerjee, T., Liu, M., & How, J. P. (2017). Quickest change detection approach to optimal control in Markov decision processes with model changes. In 2017 American control conference (ACC) (pp. 399–405).

3. Brafman, R. I., & Tennenholtz, M. (2003). R-max—A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213–231.

4. Chalkiadakis, G., & Boutilier, C. (2003). Coordination in multiagent reinforcement learning: A Bayesian approach. In Proceedings of the 2nd international conference on autonomous agents and multiagent systems (AAMAS) (pp. 709–716).

5. Crandall, J. W. (2012). Just add pepper: Extending learning algorithms for repeated matrix games to repeated Markov games. In Proceedings of the 11th international conference on autonomous agents and multiagent systems (AAMAS) (pp. 399–406).

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How to Prevent the Continuous Damage of Noises to Model Training?;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

2. Efficient Bayesian Policy Reuse With a Scalable Observation Model in Deep Reinforcement Learning;IEEE Transactions on Neural Networks and Learning Systems;2023

3. Opponent Exploitation Based on Bayesian Strategy Inference and Policy Tracking;IEEE Transactions on Games;2023

4. OM-TCN: A dynamic and agile opponent modeling approach for competitive games;Information Sciences;2022-11

5. Bayesian Opponent Exploitation by Inferring the Opponent’s Policy Selection Pattern;2022 IEEE Conference on Games (CoG);2022-08-21