Short-term Hebbian learning can implement transformer-like attention


Ellwood Ian T.


Transformers have revolutionized machine learning models of language and vision, but their connection with neuroscience remains tenuous. Built from attention layers, they require a mass comparison of queries and keys that is difficult to perform using traditional neural circuits. Here, we show that neurons can implement attention-like computations using short-term, Hebbian synaptic potentiation. We call our mechanism the match-and-control principle and it proposes that when activity in an axon is synchronous, or matched, with the somatic activity of a neuron that it synapses onto, the synapse can be briefly strongly potentiated, allowing the axon to take over, or control, the activity of the downstream neuron for a short time. In our scheme, the keys and queries are represented as spike trains and comparisons between the two are performed in individual spines allowing for hundreds of key comparisons per query and roughly as many keys and queries as there are neurons in the network.


Cold Spring Harbor Laboratory

Reference52 articles.

1. Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N. Gomez , Łukasz Kaiser , and Illia Polosukhin . Attention is all you need. Advances in Neural Information Processing Systems, 2017-December:5999–6009, 6 2017. ISSN 10495258.

2. Chitwan Saharia , William Chan , Saurabh Saxena , Lala Li , Jay Whang , Emily Denton , Seyed Kamyar Seyed Ghasemipour , Burcu Karagol Ayan , S. Sara Mahdavi , Rapha Gontijo Lopes , Tim Salimans , Jonathan Ho , David J Fleet , and Mohammad Norouzi . Photorealistic text-to-image diffusion models with deep language understanding. 5 2022.

3. Aditya Ramesh , Prafulla Dhariwal , Alex Nichol , Casey Chu , and Mark Chen OpenAI . Hier-archical text-conditional image generation with clip latents. 4 2022.

4. A survey of transformers

5. Bert: Pre-training of deep bidirectional transformers for language understanding;NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference,2018







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3