1. SuttonRS.Temporal Credit Assignment in Reinforcement Learning. PhD thesis. University of Massachusetts Amherst;1984.
2. Reinforcement Learning: A Survey
3. Playing Atari with deep reinforcement learning;Mnih V;CoRR,2013
4. MnihV BadiaAP MirzaM et al.Asynchronous methods for deep reinforcement learning. ICML'16;2016:1928‐1937.JMLR.org
5. Proximal policy optimization algorithms;Schulman J;CoRR,2017