Attention weights accurately predict language representations in the brain-Reference-Cited by-同舟云学术

Attention weights accurately predict language representations in the brain

Published:2022-12-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Lamarre Mathis,Chen Catherine,Deniz Fatma

Abstract

AbstractIn Transformer-based language models (LMs) the attention mechanism converts token embeddings into contextual embeddings that incorporate information from neighboring words. The resulting contextual hidden state embeddings have enabled highly accurate models of brain responses, suggesting that the attention mechanism constructs contextual embeddings that carry information reflected in language-related brain representations. However, it is unclear whether the attention weights that are used to integrate information across words are themselves related to language representations in the brain. To address this question we analyzed functional magnetic resonance imaging (fMRI) recordings of participants reading English language narratives. We provided the narrative text as input to two LMs (BERT and GPT-2) and extracted their corresponding attention weights. We then used encoding models to determine how well attention weights can predict recorded brain responses. We find that attention weights accurately predict brain responses in much of the frontal and temporal cortices. Our results suggest that the attention mechanism itself carries information that is reflected in brain representations. Moreover, these results indicate cortical areas in which context integration may occur.

Publisher

Cold Spring Harbor Laboratory

Reference40 articles.

1. Samira Abnar and Willem Zuidema . 2020. Quantifying attention flow in transformers. arXiv preprint arXiv:2005.00928.

2. Charlotte Caucheteux , Alexandre Gramfort , and Jean-Rémi King . 2021. Model-based analysis of brain activity reveals the hierarchy of language in 305 subjects. arXiv preprint arXiv:2110.06078.

3. The visual word form area (vwfa) is part of both language and attention circuitry;Nature communications,2019

4. Kevin Clark , Urvashi Khandelwal , Omer Levy , and Christopher D Manning . 2019. What does bert look at? an analysis of bert’s attention. arXiv preprint arXiv:1906.04341.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The cortical representation of language timescales is shared between reading and listening;Communications Biology;2024-03-07