1. Abbasi-Yadkori, Y., & Szepesvári, C. (2011). Regret bounds for the adaptive control of linear quadratic systems. In Proceedings of the 24th annual conference on learning theory, JMLR workshop and conference proceedings, (pp. 1–26).
2. Real-time optimization by extremum-seeking control;Ariyur,2003
3. Adaptive control;Åström,2013
4. Dynamic programming and optimal control, vol. 1;Bertsekas,1995
5. Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design;Bian;Automatica,2016