1. Policy gradi ent methods for reinforcement learning with function approximation;sutton;Advances in neural information processing systems,1999
2. A new framework for multi-agent reinforcement learning centralized training and exploration with decentralized execution via policy distillation;chen;ArXiv Preprint,2019
3. ATAC
4. Making scheduling” cool”: Temperature-aware workload placement in data centers;moore;USENIX Annual Technical Conference General Track,0
5. Epistemic Conditions for Nash Equilibrium