Funder
Council of Scientific and Industrial Research, India
Subject
Electrical and Electronic Engineering,Mechanical Engineering,General Computer Science,Control and Systems Engineering
Reference33 articles.
1. Least squares policy evaluation algorithms with linear function approximation;Nedić;Discrete Event Dyn. Syst.,2003
2. Improved temporal difference methods with linear function approximation;Bertsekas,2004
3. Convergence results for some temporal difference methods based on least squares;Yu;IEEE Trans. Automat. Control,2009
4. A finite time analysis of temporal difference learning with linear function approximation;Bhandari;Oper. Res.,2021
5. Finite-sample analysis of contractive stochastic approximation using smooth convex envelopes;Chen,2020