Language processing in brains and deep neural networks: computational convergence and its limits-Reference-Cited by-同舟云学术

Language processing in brains and deep neural networks: computational convergence and its limits

Published:2020-07-04 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Caucheteux Charlotte,KING Jean-Remi

Abstract

Deep Learning has recently led to major advances in natural language processing. Do these models process sentences similarly to humans, and is this similarity driven by specific principles? Using a variety of artificial neural networks, trained on image classification, word embedding, or language modeling, we evaluate whether their architectural and functional properties lead them to generate activations linearly comparable to those of 102 human brains measured with functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG). We show that image, word and contextualized word embeddings separate the hierarchical levels of language processing in the brain. Critically, we compare \NNetworks{} embeddings in their ability to linearly map onto these brain responses. The results show that (1) the position of the layer in the network and (2) the ability of the network to accurately predict words from context are the main factors responsible for the emergence of brain-like representations in artificial neural networks. Together, these results show how perceptual, lexical and compositional representations precisely unfold within each cortical region and contribute to uncovering the governing principles of language processing in brains and algorithms.

Publisher

Cold Spring Harbor Laboratory

Reference70 articles.

1. S. Abnar , R. Ahmed , M. Mijnheer , and W. H. Zuidema . Experiential, distributional and dependency-based word embeddings have complementary roles in decoding brain activity. CoRR, abs/1711.09285, 2017. URL http://arxiv.org/abs/1711.09285.

2. Multiple Regions of a Cortical Network Commonly Encode the Meaning of Words in Multiple Grammatical Positions of Read Sentences

3. N. Athanasiou , E. Iosif , and A. Potamianos . Neural activation semantic models: Computational lexical semantic models of localized neural activations. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2867–2878, Santa Fe, New Mexico, USA, Aug. 2018. Association for Computational Linguistics. URL https://www.aclweb.org/anthology/C18-1243.

4. J. Baek , G. Kim , J. Lee , S. Park , D. Han , S. Yun , S. J. Oh , and H. Lee . What is wrong with scene text recognition model comparisons? dataset and model analysis. In Proceedings of the IEEE International Conference on Computer Vision, pages 4715–4723, 2019.

5. Magnetoencephalography for brain electrophysiology and imaging

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design and evaluation of a global workspace agent embodied in a realistic multimodal environment;Frontiers in Computational Neuroscience;2024-06-14

2. AI as a Model for the Brain;Artificial Intelligence and Brain Research;2024

3. Surprisal From Language Models Can Predict ERPs in Processing Predicate-Argument Structures Only if Enriched by an Agent Preference Principle;Neurobiology of Language;2023-12-18

4. Robust multi-domain descriptive text classification leveraging conventional and hybrid deep learning models;International Journal of Information Technology;2023-10-25

5. Multiplicative processing in the modeling of cognitive activities in large neural networks;Biophysical Reviews;2023-06-22