1. Multi-agent actor-critic for mixed cooperative-competitive environments;lowe;Advances in neural information processing systems,2017
2. Addressing function approximation error in actor-critic methods;fujimoto;Int Conference on Machine Learning,2018
3. Reducing overestimation bias in multi-agent domains using double centralized critics;ackermann;arXiv preprint arXiv 1910 01465,2019
4. Openai’s maddpg algorithm;nguyen,2020
5. Twin delayed ddpg,0