1. Deterministic policy gradient algorithms;silver;Proc 31st Int Conf Int Conf Mach Learn,2014
2. Trust region policy optimization;schulman;arXiv 1502 05477 [cs],2015
3. Federated reinforcement learning: techniques, applications, and open challenges
4. Fault-tolerant federated reinforcement learning with theoretical guarantee;fan;Proc Adv Neural Inf Process Syst,2021
5. Policy gradient methods for reinforcement learning with function approximation;sutton;Proc Adv Neural Inf Process Syst,2000