1. Improved algorithms for linear stochastic bandits;Abbasi-Yadkori,2011
2. On the sample complexity of the linear quadratic regulator;Dean;Found. Comput. Math.,102017
3. Regret bounds for robust adaptive control of the linear quadratic regulator;Dean,2018
4. From self-tuning regulators to reinforcement learning and back again;Matni,2019