1. Jalaj Bhandari , Daniel Russo , and Raghav Singal . 2021 . A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation . Operations Research , Vol. 69 , 3 (01 May 2021), 950--973. https://doi.org/10.1287/opre.2020.2024 10.1287/opre.2020.2024 Jalaj Bhandari, Daniel Russo, and Raghav Singal. 2021. A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation. Operations Research, Vol. 69, 3 (01 May 2021), 950--973. https://doi.org/10.1287/opre.2020.2024
2. Dongyann (Lucy) Huo Yudong Chen and Qiaomin Xie. 2022. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes. https://doi.org/10.48550/ARXIV.2210.00953 10.48550/ARXIV.2210.00953
3. Dongyann (Lucy) Huo Yudong Chen and Qiaomin Xie. 2022. Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes. https://doi.org/10.48550/ARXIV.2210.00953
4. Chandrashekar Lakshminarayanan and Csaba Szepesvári . 2018 . Linear Stochastic Approximation: How Far Does Constant Step-Size and Iterate Averaging Go? . In Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics. PMLR, 1347--1355 . Chandrashekar Lakshminarayanan and Csaba Szepesvári. 2018. Linear Stochastic Approximation: How Far Does Constant Step-Size and Iterate Averaging Go?. In Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics. PMLR, 1347--1355.
5. Wenlong Mou , Chris Junchi Li , Martin J. Wainwright , Peter L. Bartlett , and Michael I. Jordan . 2020. On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration . In Proceedings of Thirty Third Conference on Learning Theory. PMLR, 2947--2997 . Wenlong Mou, Chris Junchi Li, Martin J. Wainwright, Peter L. Bartlett, and Michael I. Jordan. 2020. On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration. In Proceedings of Thirty Third Conference on Learning Theory. PMLR, 2947--2997.