Funder
MEXT | Japan Society for the Promotion of Science
Human Frontier Science Program
Harvard University
Japan Agency for Medical Research and Development
U.S. Department of Health & Human Services | NIH | National Institute of Neurological Disorders and Stroke
Simons Foundation
U.S. Department of Health & Human Services | NIH | National Institute of Mental Health
Publisher
Springer Science and Business Media LLC
Reference61 articles.
1. Montague, P. R., Dayan, P. & Sejnowski, T. J. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936–1947 (1996).
2. Schultz, W., Dayan, P. & Montague, P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997).
3. Rescorla, R. A. & Wagner, A. R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. Class. Cond. II Curr. Res. Theory 2, 64–99 (1972).
4. Sutton, R. S. & Barto, A. G. A temporal-difference model of classical conditioning. In: Proceedings of the Ninth Annual Conference of the Cognitive Science Society. 355–378 (1987).
5. Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, 1998).
Cited by
43 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献