1. Shipra Agrawal and Navin Goyal. 2013. Thompson sampling for contextual bandits with linear payoffs. In International conference on machine learning. PMLR, 127--135.
2. Sanjeev Arora, Simon Du, Sham Kakade, Yuping Luo, and Nikunj Saunshi. 2020. Provable representation learning for imitation learning via bi-level optimization. In International Conference on Machine Learning. PMLR, 367--376.
3. Cold-Start Item and User Recommendation with Decoupled Completion and Transduction
4. Carousel Personalization in Music Streaming Apps with Contextual Bandits
5. Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence, Vol. 35, 8 (2013), 1798--1828.