Underestimation estimators to Q-learning-Reference-Cited by-同舟云学术

Underestimation estimators to Q-learning

Published:2022-08 Issue: Volume:607 Page:173-185
ISSN:0020-0255
Container-title:Information Sciences
language:en
Short-container-title:Information Sciences

Author:

Abliz Patigül,Ying Shi

Funder

National Natural Science Foundation of China

Publisher

Elsevier BV

Subject

Artificial Intelligence,Information Systems and Management,Computer Science Applications,Theoretical Computer Science,Control and Systems Engineering,Software

Reference27 articles.

1. Q-learning;Watkins;Mach. Learn.,1992

2. R.S. Sutton, A.G. Barto, Reinforcement learning: An introduction (2018).

3. Asynchronous stochastic approximation and q-learning;Tsitsiklis;Mach. Learn.,1994

4. H. Van Hasselt, Estimating the maximum expected value: an analysis of (nested) cross validation and the maximum sample average, arXiv preprint arXiv:1302.7175.

5. S. Thrun, A. Schwartz, Issues in using function approximation for reinforcement learning, in: Proceedings of the Fourth Connectionist Models Summer School, Hillsdale, NJ, 1993, pp. 255–263.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A unified framework to control estimation error in reinforcement learning;Neural Networks;2024-10

2. Intelligent Interference Waveform Design for Radar Detection based on Cross-Correlation Value Function;IEEE Transactions on Aerospace and Electronic Systems;2024

3. Off‐policy correction algorithm for double Q network based on deep reinforcement learning;IET Cyber-Systems and Robotics;2023-12

4. Traffic signal optimization control method based on adaptive weighted averaged double deep Q network;Applied Intelligence;2023-01-27