Attentional Reinforcement Learning in the Brain-Reference-Cited by-同舟云学术

Attentional Reinforcement Learning in the Brain

Published:2020-01-02 Issue:1 Volume:38 Page:49-64
ISSN:0288-3635
Container-title:New Generation Computing
language:en
Short-container-title:New Gener. Comput.

Author:

Yamakawa Hiroshi^ORCID

Abstract

AbstractRecently, attention mechanisms have significantly boosted the performance of natural language processing using deep learning. An attention mechanism can select the information to be used, such as by conducting a dictionary lookup; this information is then used, for example, to select the next utterance word in a sentence. In neuroscience, the basis of the function of sequentially selecting words is considered to be the cortico-basal ganglia-thalamocortical loop. Here, we first show that the attention mechanism used in deep learning corresponds to the mechanism in which the cerebral basal ganglia suppress thalamic relay cells in the brain. Next, we demonstrate that, in neuroscience, the output of the basal ganglia is associated with the action output in the actor of reinforcement learning. Based on these, we show that the aforementioned loop can be generalized as reinforcement learning that controls the transmission of the prediction signal so as to maximize the prediction reward. We call this attentional reinforcement learning (ARL). In ARL, the actor selects the information transmission route according to the attention, and the prediction signal changes according to the context detected by the information source of the route. Hence, ARL enables flexible action selection that depends on the situation, unlike traditional reinforcement learning, wherein the actor must directly select an action.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Hardware and Architecture,Theoretical Computer Science,Software

Link

http://link.springer.com/content/pdf/10.1007/s00354-019-00081-z.pdf

Reference55 articles.

1. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł.U., Polosukhin, I.: Attention is all you need. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 30, pp. 5998–6008. Curran Associates Inc, New York (2017)

2. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2018)

3. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language models are unsupervised multitask learners. OpenAI Blog 1(8) (2019)

4. Crosson, B.: Subcortical functions in language: a working model. Brain Lang. 25(2), 257–292 (1985)

5. Crosson, B.A.: Subcortical Functions in Language and Memory. Guilford Press, New York (1992)

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Designing quantum multi-category classifier from the perspective of brain processing information;Machine Learning: Science and Technology;2024-09-01

2. Advanced Reinforcement Learning and Its Connections with Brain Neuroscience;Research;2023-01

3. Beyin Temelli Öğrenme Yaklaşımı Konulu Yüksek Lisans Tezlerine Yönelik Bir İçerik Analizi;Dokuz Eylül Üniversitesi Buca Eğitim Fakültesi Dergisi;2022-06-23

4. Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments;ELIFE;2021

5. Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments;eLife;2021-12-02