1. Junyoung Chung , Caglar Gulcehre , KyungHyun Cho , and Yoshua Bengio . 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 ( 2014 ). Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).
2. Clinical data based optimal STI strategies for HIV: a reinforcement learning approach
3. Probabilistic policy reuse in a reinforcement learning agent
4. Scott Fujimoto , Edoardo Conti , Mohammad Ghavamzadeh , and Joelle Pineau . 2019. Benchmarking batch deep reinforcement learning algorithms. arXiv preprint arXiv:1910.01708 ( 2019 ). Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh, and Joelle Pineau. 2019. Benchmarking batch deep reinforcement learning algorithms. arXiv preprint arXiv:1910.01708 (2019).