Temporal difference learning for heuristic search and game playing-Reference-Cited by-同舟云学术

Temporal difference learning for heuristic search and game playing

Published:2000-01 Issue:1 Volume:122 Page:3-21
ISSN:0020-0255
Container-title:Information Sciences
language:en
Short-container-title:Information Sciences

Author:

Beal D.F.,Smith M.C.

Publisher

Elsevier BV

Subject

Artificial Intelligence,Information Systems and Management,Computer Science Applications,Theoretical Computer Science,Control and Systems Engineering,Software

Reference15 articles.

1. Learning to predict by the methods of temporal differences;Sutton;Machine Learning,1988

2. D.F. Beal, M.C. Smith, Temporal coherence and prediction decay in temporal difference learning, Technical Report no. 756, Department of Computer Science, Queen Mary and Westfield College, University of London, 1998

3. Machine learning in computer chess: the next generation;Fürnkranz;International Computer Chess Association Journal,1996

4. Evaluation tuning for computer chess: linear discriminant methods;Anantharaman;International Computer Chess Association Journal,1997

5. Practical issues in temporal difference learning;Tesauro;Machine Learning,1992

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Variants of Bellman equation on reinforcement learning problems;2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022);2022-11-11

2. Two-Agent Self-Play;Deep Reinforcement Learning;2022

3. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play;Science;2018-12-07

4. Online Adaptable Learning Rates for the Game Connect-4;IEEE Transactions on Computational Intelligence and AI in Games;2016-03

5. On Learning From Game Annotations;IEEE Transactions on Computational Intelligence and AI in Games;2015-09