1. Identifying New Podcasts with High General Appeal Using a Pure Exploration Infinitely-Armed Bandit Strategy
2. Olivier Chapelle and Lihong Li. 2011. An empirical evaluation of Thompson sampling. In Advances in neural information processing systems 24. Granada Spain. Olivier Chapelle and Lihong Li. 2011. An empirical evaluation of Thompson sampling. In Advances in neural information processing systems 24. Granada Spain.
3. Bias and Debias in Recommender System: A Survey and Future Directions
4. Melanie Coggan . 2004. Exploration and exploitation in reinforcement learning. Research supervised by Prof. Doina Precup , CRA-W DMP Project at McGill University ( 2004 ). Melanie Coggan. 2004. Exploration and exploitation in reinforcement learning. Research supervised by Prof. Doina Precup, CRA-W DMP Project at McGill University (2004).
5. Expediting Exploration by Attribute-to-Feature Mapping for Cold-Start Recommendations