Error bounds for constant step-size -learning-Reference-Cited by-同舟云学术

Error bounds for constant step-size -learning

Published:2012-12 Issue:12 Volume:61 Page:1203-1208
ISSN:0167-6911
Container-title:Systems & Control Letters
language:en
Short-container-title:Systems & Control Letters

Author:

Beck C.L.,Srikant R.

Publisher

Elsevier BV

Subject

Electrical and Electronic Engineering,Mechanical Engineering,General Computer Science,Control and Systems Engineering

Reference14 articles.

1. C. Watkins, Learning from delayed rewards, Ph.D. Thesis, University of Cambridge, 1989.

2. Q-learning;Watkins;Machine Learning,1992

3. Asynchronous stochastic approximation and Q-learning;Tsitsiklis;Machine Learning,1994

4. On the convergence of stochastic iterative dynamic programming algorithms;Jaakkola;Neural Computation,1994

5. V.S. Borkar, On the number of samples required for Q-learning, in: 38th Allerton Conf. on Communication, Control and Computing, Monticello, Illinois, 2000.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Final Iteration Convergence Bound of Q-Learning: Switching System Approach;IEEE Transactions on Automatic Control;2024-07

2. Settling the sample complexity of model-based offline reinforcement learning;The Annals of Statistics;2024-02-01

3. Online Monitoring of Heterogeneous Partially Observable Data Streams Based on Q-Learning;IEEE Transactions on Automation Science and Engineering;2024

4. Compressed Federated Reinforcement Learning with a Generative Model;Lecture Notes in Computer Science;2024

5. Reinforcement learning for humanitarian relief distribution with trucks and UAVs under travel time uncertainty;Transportation Research Part C: Emerging Technologies;2023-12