1. Abbasi-Yadkori Y, Szepesvári C (2011) Regret bounds for the adaptive control of linear quadratic systems. Kakade S, von Luxburg U, eds.24th Annual Conf. Learn. Theory (COLT), Vol. 19, 1–26.
2. Abbasi-Yadkori Y, Pal D, Szepesvári C (2011) Improved algorithms for linear stochastic bandits. Shawe-Taylor J, Zemel RS, Bartlett P, Pereira FCN, Weinberger KQ, eds.Advances in Neural Information Processing Systems 24, 2312–2320.
3. Dynamic Pricing for Nonperishable Products with Demand Learning