1. Wasserstein generative adversarial networks;Arjovsky,2017
2. Finite-time analysis of the multiarmed bandit problem;Auer;Machine Learning,2002
3. Badia, A. P., Sprechmann, P., Vitvitskyi, A., Guo, D., Piot, B., Kapturowski, S., Tieleman, O., Arjovsky, M., Pritzel, A., Bolt, A., & Blundell, C. (2020). Never give up: Learning directed exploration strategies. In International conference on learning representations (pp. 1–28).
4. Sample-efficient imitation learning via generative adversarial nets;Blondé,2019
5. Openai gym;Brockman,2016