1. Yasin Abbasi-Yadkori and Csaba Szepesvári . 2011 . Regret bounds for the adaptive control of linear quadratic systems . In Proceedings of the 24th Annual Conference on Learning Theory. JMLR Workshop and Conference Proceedings, 1--26 . Yasin Abbasi-Yadkori and Csaba Szepesvári. 2011. Regret bounds for the adaptive control of linear quadratic systems. In Proceedings of the 24th Annual Conference on Learning Theory. JMLR Workshop and Conference Proceedings, 1--26.
2. Naman Agarwal , Brian Bullins , Elad Hazan , Sham Kakade , and Karan Singh . 2019 . Online control with adversarial disturbances . In International Conference on Machine Learning. PMLR, 111--119 . Naman Agarwal, Brian Bullins, Elad Hazan, Sham Kakade, and Karan Singh. 2019. Online control with adversarial disturbances. In International Conference on Machine Learning. PMLR, 111--119.
3. The size of the membership-set in a probabilistic framework
4. Distributed Q-Learning for Dynamically Decoupled Systems
5. Siavash Alemzadeh , Shahriar Talebi , and Mehran Mesbahi . 2021. D3PI: Data-Driven Distributed Policy Iteration for Homogeneous Interconnected Systems. arXiv preprint arXiv:2103.11572 ( 2021 ). Siavash Alemzadeh, Shahriar Talebi, and Mehran Mesbahi. 2021. D3PI: Data-Driven Distributed Policy Iteration for Homogeneous Interconnected Systems. arXiv preprint arXiv:2103.11572 (2021).