1. Debabrota Basu , Qian Lin , Weidong Chen , Hoang Tam Vo , Zihong Yuan, Pierre Senellart, and Stéphane Bressan. 2015 . Cost-model oblivious database tuning with reinforcement learning. In Database and Expert Systems Applications . 253--268. Debabrota Basu, Qian Lin, Weidong Chen, Hoang Tam Vo, Zihong Yuan, Pierre Senellart, and Stéphane Bressan. 2015. Cost-model oblivious database tuning with reinforcement learning. In Database and Expert Systems Applications. 253--268.
2. Ranked batch-mode active learning;Cardoso Thiago NC;Information Sciences,2017
3. Jonathan Ho and Stefano Ermon . 2016. Generative adversarial imitation learning. Advances in neural information processing systems 29 ( 2016 ). Jonathan Ho and Stefano Ermon. 2016. Generative adversarial imitation learning. Advances in neural information processing systems 29 (2016).
4. How to train your robot with deep reinforcement learning: lessons we have learned;Ibarz Julian;The International Journal of Robotics Research,2021
5. Emilie Kaufmann Olivier Cappé and Aurélien Garivier. 2012. On Bayesian upper confidence bounds for bandit problems. In Artificial intelligence and statistics. PMLR 592--600. Emilie Kaufmann Olivier Cappé and Aurélien Garivier. 2012. On Bayesian upper confidence bounds for bandit problems. In Artificial intelligence and statistics. PMLR 592--600.