1. Adam: A Method for Stochastic Optimization;kingma;3rd International Conference on Learning Representations,2015
2. Prioritized Experience Replay;schaul;4th International Conference on Learning Representations,2016
3. Automatic Differentiation in PyTorch;paszke;NIPS 2017 Workshop on Autodiff,2017
4. TorchRL Documentation;PyTorch,0
5. Dueling network architectures for deep reinforcement learning;wang;Proceedings of the 33rd International Conference on International Conference on Machine Learning,2016