Abstract
AbstractThe neuroscience of perception has recently been revolutionized with an integrative modeling approach in which computation, brain function, and behavior are linked across many datasets and many computational models. By revealing trends across models, this approach yields novel insights into cognitive and neural mechanisms in the target domain. We here present a first systematic study taking this approach to higher-level cognition: human language processing, our species’ signature cognitive skill. We find that the most powerful ‘transformer’ models predict nearly 100% of explainable variance in neural responses to sentences and generalize across different datasets and imaging modalities (fMRI, ECoG). Models’ neural fits (‘brain score’) and fits to behavioral responses are both strongly correlated with model accuracy on the next-word prediction task (but not other language tasks). Model architecture appears to substantially contribute to neural fit. These results provide computationally explicit evidence that predictive processing fundamentally shapes the language comprehension mechanisms in the human brain.SignificanceLanguage is a quintessentially human ability. Research has long probed the functional architecture of language processing in the mind and brain using diverse brain imaging, behavioral, and computational modeling approaches. However, adequate neurally mechanistic accounts of how meaning might be extracted from language are sorely lacking. Here, we report an important first step toward addressing this gap by connecting recent artificial neural networks from machine learning to human recordings during language processing. We find that the most powerful models predict neural and behavioral responses across different datasets up to noise levels. Models that perform better at predicting the next word in a sequence also better predict brain measurements – providing computationally explicit evidence that predictive processing fundamentally shapes the language comprehension mechanisms in the human brain.
Publisher
Cold Spring Harbor Laboratory
Reference178 articles.
1. Incremental interpretation at verbs: restricting the domain of subsequent reference
2. Arora, S. , Cohen, N. , & Hazan, E. (2018). On the optimization of deep networks: Implicit acceleration by overparameterization. International Conference on Machine Learning (ICML), 372–389. http://arxiv.org/abs/1802.06509
3. Bao, P. , She, L. , McGill, M. , & Tsao, D. Y. (2020). A map of object space in primate inferotemporal cortex. Nature, 1–6. https://doi.org/10.1038/s41586-020-2350-5
4. Bashivan, P. , Kar, K. , & DiCarlo, J. J. (2019). Neural population control via deep image synthesis. Science, 364(6439). https://doi.org/10.1126/science.aav9436
5. Canonical Microcircuits for Predictive Coding
Cited by
39 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献