1. Policy Learning with Constraints in Model-free Reinforcement Learning: A Survey
2. Trust region policy optimization;schulman;32nd Int Conf Mach Learn ICML 2015,2015
3. Asynchronous methods for deep reinforcement learning;mnih;ICML 2016 33rd International Conf Machine Learning,2016
4. Next generation mobile networks radio access performance evaluation methodology;NGMN Tech Work Gr Steer Comm,2008
5. Optimizing neural networks with Kronecker-factored approximate curvature;martens;32nd Int Conf Mach Learn ICML 2015,2015