Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms-Reference-Cited by-同舟云学术

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Published:2021 Issue: Volume: Page:321-384
ISSN:2198-4182
Container-title:Handbook of Reinforcement Learning and Control
language:
Short-container-title:

Author:

Zhang Kaiqing,Yang Zhuoran,Başar Tamer

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-60990-0_12

Reference384 articles.

1. Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484–489 (2016)

2. Silver, D., Schrittwieser, J., Simonyan, K., Antonoglou, I., Huang, A., Guez, A., Hubert, T., Baker, L., Lai, M., Bolton, A., et al.: Mastering the game of Go without human knowledge. Nature 550(7676), 354 (2017)

3. OpenAI: Openai five. https://blog.openai.com/openai-five/ (2018)

4. Vinyals, O., Babuschkin, I., Chung, J., Mathieu, M., Jaderberg, M., Czarnecki, W.M., Dudzik, A., Huang, A., Georgiev, P., Powell, R., Ewalds, T., Horgan, D., Kroiss, M., Danihelka, I., Agapiou, J., Oh, J., Dalibard, V., Choi, D., Sifre, L., Sulsky, Y., Vezhnevets, S., Molloy, J., Cai, T., Budden, D., Paine, T., Gulcehre, C., Wang, Z., Pfaff, T., Pohlen, T., Wu, Y., Yogatama, D., Cohen, J., McKinney, K., Smith, O., Schaul, T., Lillicrap, T., Apps, C., Kavukcuoglu, K., Hassabis, D., Silver, D.: AlphaStar: mastering the real-time strategy game starcraft II. https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii/ (2019)

5. Kober, J., Bagnell, J.A., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)

Cited by 336 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Balancing individual and collective strategies: A new approach in metaheuristic optimization;Mathematics and Computers in Simulation;2025-01

2. Age of information minimization in UAV-assisted data harvesting networks by multi-agent deep reinforcement curriculum learning;Expert Systems with Applications;2024-12

3. Reinforcement learning for electric vehicle charging scheduling: A systematic review;Transportation Research Part E: Logistics and Transportation Review;2024-10

4. iTRPL: An intelligent and trusted RPL protocol based on Multi-Agent Reinforcement Learning;Ad Hoc Networks;2024-10

5. Multi-UAV aided energy-aware transmissions in mmWave communication network: Action-branching QMIX network;Journal of Network and Computer Applications;2024-10