TD-Gammon: A Self-Teaching Backgammon Program-Reference-Cited by-同舟云学术

TD-Gammon: A Self-Teaching Backgammon Program

Published:1995 Issue: Volume: Page:267-285
ISSN:
Container-title:Applications of Neural Networks
language:
Short-container-title:

Author:

Tesauro Gerald

Publisher

Springer US

Link

http://link.springer.com/content/pdf/10.1007/978-1-4757-2379-3_11

Reference24 articles.

1. H. Berliner, “Computer backgammon.” Scientific American 243:1, 64–72 (1980).

2. D. P. Bertsekas, Dynamic Programming: Deterministic and Stochastic Models. En-glewood Cliffs NJ: Prentice Hall (1987).

3. J. Christensen and R. Korf, “A unified theory of heuristic evaluation functions and its application to learning.” Proc. of AAAI-86, 148-152 (1986).

4. P. Dayan, ‘The convergence of TD(λ) for general λ.” Machine Learning 8, 341–362 (1992).

5. P. W. Frey, “Algorithmic strategies for improving the performance of game playing programs.” In: D. Farmer et al. (Eds.), Evolution, Games and Learning. Amsterdam: North Holland (1986).

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Su Altı Otonom Araçlarda Derin Q-Ağları Algoritması Kullanılarak ROS Tabanlı Yol Planlama;Gazi Üniversitesi Fen Bilimleri Dergisi Part C: Tasarım ve Teknoloji;2024-06-29

2. M0RV Model: Advancing the MuZero Algorithm Through Strategic Data Optimization Reuse and Value Function Refinement;IEEE Access;2024

3. A social path to human-like artificial intelligence;Nature Machine Intelligence;2023-11-17

4. Improving the Performance of Deep Q-learning in Games Pong and Ms. Pacman;Highlights in Science, Engineering and Technology;2023-04-01

5. Reinforcement learning architecture for cyber–physical–social AI: state-of-the-art and perspectives;Artificial Intelligence Review;2023-03-22