1. Online learning for linearly parametrized control problems;Abbasi-Yadkori,2013
2. Model-free linear quadratic control via reduction to expert prediction;Abbasi-Yadkori,2019
3. Improved algorithms for linear stochastic bandits;Abbasi-Yadkori;Advances in Neural Information Processing Systems,2011
4. Online least squares estimation with self-normalized processes: An application to bandit problems;Abbasi-Yadkori,2011
5. Abbasi-Yadkori, Yasin, & Szepesvári, Csaba (2011). Regret bounds for the adaptive control of linear quadratic systems. In Proceedings of the 24th annual conference on learning theory (pp. 1–26).