1. Yasin Abbasi-Yadkori , Dávid Pál , and Csaba Szepesvári . 2011. Improved Algorithms for Linear Stochastic Bandits . In Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011 . Proceedings of a meeting held 12-14 December 2011, Granada, Spain . 2312–2320. Yasin Abbasi-Yadkori, Dávid Pál, and Csaba Szepesvári. 2011. Improved Algorithms for Linear Stochastic Bandits. In Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, Granada, Spain. 2312–2320.
2. Naoki Abe and Philip M. Long . 1999 . Associative Reinforcement Learning using Linear Probabilistic Concepts . In Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999 ), Bled, Slovenia, June 27 - 30 , 1999. Morgan Kaufmann, 3–11. Naoki Abe and Philip M. Long. 1999. Associative Reinforcement Learning using Linear Probabilistic Concepts. In Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27 - 30, 1999. Morgan Kaufmann, 3–11.
3. Charu C Aggarwal 2016. Recommender systems. Vol. 1 . Springer . Charu C Aggarwal 2016. Recommender systems. Vol. 1. Springer.
4. Noga Alon , Nicolò Cesa-Bianchi , Ofer Dekel , and Tomer Koren . 2015 . Online Learning with Feedback Graphs: Beyond Bandits . In Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015(JMLR Workshop and Conference Proceedings, Vol. 40) . JMLR.org, 23–35. Noga Alon, Nicolò Cesa-Bianchi, Ofer Dekel, and Tomer Koren. 2015. Online Learning with Feedback Graphs: Beyond Bandits. In Proceedings of The 28th Conference on Learning Theory, COLT 2015, Paris, France, July 3-6, 2015(JMLR Workshop and Conference Proceedings, Vol. 40). JMLR.org, 23–35.
5. Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback