1. Proximal policy optimization algorithms;schulman,2017
2. Multi-Agent Deep Stochastic Policy Gradient for Event Based Dynamic Spectrum Access
3. Scaling multi-agent reinforcement learning with selective parameter sharing;christianos,2021
4. Network Theory: The Basics;owen-smith,2017
5. A Survey of Multi-Task Deep Reinforcement Learning;varghese;Electronics Journal,2020