Predictive Coding or Just Feature Discovery? An Alternative Account of Why Language Models Fit Brain Data-Reference-Cited by-同舟云学术

Predictive Coding or Just Feature Discovery? An Alternative Account of Why Language Models Fit Brain Data

Published:2023-02-24 Issue: Volume: Page:1-16
ISSN:2641-4368
Container-title:Neurobiology of Language
language:en
Short-container-title:

Author:

Antonello Richard¹^ORCID,Huth Alexander¹^ORCID

Affiliation:

1. Department of Computer Science, University of Texas at Austin, Austin, TX, USA

Abstract

Abstract Many recent studies have shown that representations drawn from neural network language models are extremely effective at predicting brain responses to natural language. But why do these models work so well? One proposed explanation is that language models and brains are similar because they have the same objective: to predict upcoming words before they are perceived. This explanation is attractive because it lends support to the popular theory of predictive coding. We provide several analyses that cast doubt on this claim. First, we show that the ability to predict future words does not uniquely (or even best) explain why some representations are a better match to the brain than others. Second, we show that within a language model, representations that are best at predicting future words are strictly worse brain models than other representations. Finally, we argue in favor of an alternative explanation for the success of language models in neuroscience: These models are effective at predicting brain responses because they generally capture a wide variety of linguistic phenomena.

Funder

Burroughs Wellcome Fund

Intel Corporation

National Institute on Deafness and Other Communication Disorders

Publisher

MIT Press

Subject

Neurology,Linguistics and Language

Link

https://direct.mit.edu/nol/article-pdf/doi/10.1162/nol_a_00087/2072549/nol_a_00087.pdf

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large-scale benchmark yields no evidence that language model surprisal explains syntactic disambiguation difficulty;Journal of Memory and Language;2024-08

2. Shared functional specialization in transformer-based language models and the human brain;Nature Communications;2024-06-29

3. Constraint satisfaction in large language models;Language, Cognition and Neuroscience;2024-06-17

4. Language Models Outperform Cloze Predictability in a Cognitive Model of Reading;2024-04-30

5. Incremental Accumulation of Linguistic Context in Artificial and Biological Neural Networks;2024-01-16