Author:
Caucheteux Charlotte,KING Jean-Remi
Abstract
Deep Learning has recently led to major advances in natural language processing. Do these models process sentences similarly to humans, and is this similarity driven by specific principles? Using a variety of artificial neural networks, trained on image classification, word embedding, or language modeling, we evaluate whether their architectural and functional properties lead them to generate activations linearly comparable to those of 102 human brains measured with functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG). We show that image, word and contextualized word embeddings separate the hierarchical levels of language processing in the brain. Critically, we compare \NNetworks{} embeddings in their ability to linearly map onto these brain responses. The results show that (1) the position of the layer in the network and (2) the ability of the network to accurately predict words from context are the main factors responsible for the emergence of brain-like representations in artificial neural networks. Together, these results show how perceptual, lexical and compositional representations precisely unfold within each cortical region and contribute to uncovering the governing principles of language processing in brains and algorithms.
Publisher
Cold Spring Harbor Laboratory
Reference70 articles.
1. S. Abnar , R. Ahmed , M. Mijnheer , and W. H. Zuidema . Experiential, distributional and dependency-based word embeddings have complementary roles in decoding brain activity. CoRR, abs/1711.09285, 2017. URL http://arxiv.org/abs/1711.09285.
2. Multiple Regions of a Cortical Network Commonly Encode the Meaning of Words in Multiple Grammatical Positions of Read Sentences
3. N. Athanasiou , E. Iosif , and A. Potamianos . Neural activation semantic models: Computational lexical semantic models of localized neural activations. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2867–2878, Santa Fe, New Mexico, USA, Aug. 2018. Association for Computational Linguistics. URL https://www.aclweb.org/anthology/C18-1243.
4. J. Baek , G. Kim , J. Lee , S. Park , D. Han , S. Yun , S. J. Oh , and H. Lee . What is wrong with scene text recognition model comparisons? dataset and model analysis. In Proceedings of the IEEE International Conference on Computer Vision, pages 4715–4723, 2019.
5. Magnetoencephalography for brain electrophysiology and imaging
Cited by
30 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献