A Neural Substrate of Prediction and Reward


Schultz Wolfram1,Dayan Peter2,Montague P. Read3


1. W. Schultz is at the Institute of Physiology, University of Fribourg, CH-1700 Fribourg, Switzerland.

2. P. Dayan is in the Department of Brain and Cognitive Sciences, Center for Biological and Computational Learning, E-25 MIT, Cambridge, MA 02139, USA.

3. P. R. Montague is in the Division of Neuroscience, Center for Theoretical Neuroscience, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA.


The capacity to predict future events permits a creature to detect, model, and manipulate the causal structure of its interactions with its environment. Behavioral experiments suggest that learning is driven by changes in the expectations about future salient events such as rewards and punishments. Physiological work has recently complemented these studies by identifying dopaminergic neurons in the primate whose fluctuating output apparently signals changes or errors in the predictions of future salient and rewarding events. Taken together, these findings can be understood through quantitative theories of adaptive optimizing control.


American Association for the Advancement of Science (AAAS)



Reference81 articles.

1. Dickinson A., Contemporary Animal Learning Theory (Cambridge Univ. Press, Cambridge, 1980); N. J. Mackintosh, Conditioning and Associative Learning (Oxford Univ. Press, Oxford, 1983); C. R. Gallistel, The Organization of Learning (MIT Press, Cambridge, MA, 1990); L. A. Real, Science253, 980 (1991) .

2. Pavlov I. P., Conditioned Reflexes (Oxford Univ. Press, Oxford, 1927); B. F. Skinner, The Behavior of Organisms (Appleton-Century-Crofts, New York, 1938); J. Olds, Drives and Reinforcement (Raven, New York 1977); R. A. Wise, in The Neuropharmacological Basis of Reward, J. M. Liebeman and S. J. Cooper, Eds. (Clarendon Press, New York, 1989); N. W. White and P. M. Milner, Annu. Rev. Psychol.43, 443 (1992); T. W. Robbins and B. J. Everitt, Curr. Opin. Neurobiol.6, 228 (1996) .

3. Rescorla R. A. Wagner A. R.in Classical Conditioning II: Current Research and Theory A. H. Black and W. F. Prokasy Eds. (Appleton-Century-Crofts New York 1972) pp. 64–69.

4. A theory of attention: Variations in the associability of stimuli with reinforcement.

5. Pearce J. M. and , Hall G., ibid. 87, 532 (1980).








Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3