A distributional code for value in dopamine-based reinforcement learning-Reference-Cited by-同舟云学术

A distributional code for value in dopamine-based reinforcement learning

Published:2020-01-15 Issue:7792 Volume:577 Page:671-675
ISSN:0028-0836
Container-title:Nature
language:en
Short-container-title:Nature

Author:

Dabney Will,Kurth-Nelson Zeb,Uchida Naoshige,Starkweather Clara Kwon,Hassabis Demis,Munos Rémi,Botvinick Matthew

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

http://www.nature.com/articles/s41586-019-1924-6.pdf

Reference42 articles.

1. Schultz, W., Stauffer, W. R. & Lak, A. The phasic dopamine signal maturing: from reward via behavioural activation to formal economic utility. Curr. Opin. Neurobiol. 43, 139–148 (2017).

2. Glimcher, P. W. Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proc. Natl Acad. Sci. USA 108, 15647–15654 (2011).

3. Watabe-Uchida, M., Eshel, N. & Uchida, N. Neural circuitry of reward prediction error. Annu. Rev. Neurosci. 40, 373–394 (2017).

4. Morimura, T., Sugiyama, M., Kashima, H., Hachiya, H. & Tanaka, T. Parametric return density estimation for reinforcement learning. In Proc. 26th Conference on Uncertainty in Artificial Intelligence (eds Grunwald, P. & Spirtes, P.) http://dl.acm.org/citation.cfm?id=3023549.3023592 (2010).

5. Bellemare, M. G., Dabney, W., & Munos, R. A distributional perspective on reinforcement learning. In International Conference on Machine Learning (eds Precup, D. & The, Y. W.) 449–458 (2017).

Cited by 255 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dopamine transients follow a striatal gradient of reward time horizons;Nature Neuroscience;2024-02-06

2. Prediction error in dopamine neurons during associative learning;Neuroscience Research;2024-02

3. Artificial Intelligence in Neuroscience;Neuroscience for Neurosurgeons;2024-01-25

4. Emergence and Causality in Complex Systems: A Survey of Causal Emergence and Related Quantitative Studies;Entropy;2024-01-24

5. Distributional reinforcement learning in prefrontal cortex;Nature Neuroscience;2024-01-10