Decoding Word Embeddings with Brain-Based Semantic Features-Reference-Cited by-同舟云学术

Decoding Word Embeddings with Brain-Based Semantic Features

Published:2021-11 Issue:3 Volume:47 Page:663-698
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:

Author:

Chersoni Emmanuele¹,Santus Enrico²,Huang Chu-Ren³,Lenci Alessandro⁴

Affiliation:

1. The Hong Kong Polytechnic University, Department of Chinese and Bilingual Studies. emmanuele.chersoni@polyu.edu.hk

2. MIT Computer Science and Artificial Intelligence Laboratory. esantus@mit.edu

3. The Hong Kong Polytechnic University, Department of Chinese and Bilingual Studies. churen.huang@polyu.edu.hk

4. University of Pisa, Department of Philology, Literature and Linguistics. alessandro.lenci@unipi.it

Abstract

Word embeddings are vectorial semantic representations built with either counting or predicting techniques aimed at capturing shades of meaning from word co-occurrences. Since their introduction, these representations have been criticized for lacking interpretable dimensions. This property of word embeddings limits our understanding of the semantic features they actually encode. Moreover, it contributes to the “black box” nature of the tasks in which they are used, since the reasons for word embedding performance often remain opaque to humans. In this contribution, we explore the semantic properties encoded in word embeddings by mapping them onto interpretable vectors, consisting of explicit and neurobiologically motivated semantic features (Binder et al. 2016). Our exploration takes into account different types of embeddings, including factorized count vectors and predict models (Skip-Gram, GloVe, etc.), as well as the most recent contextualized representations (i.e., ELMo and BERT). In our analysis, we first evaluate the quality of the mapping in a retrieval task, then we shed light on the semantic features that are better encoded in each embedding type. A large number of probing tasks is finally set to assess how the original and the mapped embeddings perform in discriminating semantic categories. For each probing task, we identify the most relevant semantic features and we show that there is a correlation between the embedding performance and how they encode those features. This study sets itself as a step forward in understanding which aspects of meaning are captured by vector spaces, by proposing a new and simple method to carve human-interpretable semantic representations from distributional vectors.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://direct.mit.edu/coli/article-pdf/47/3/663/1971848/coli_a_00412.pdf

Reference129 articles.

1. Experiential, distributional and dependency-based word embeddings have complementary roles in decoding brain activity;Abnar,2018

2. Fine-grained analysis of sentence embeddings using auxiliary prediction tasks;Adi,2017

3. Predicting neural activity patterns associated with sentences using a neurobiologically motivated model of semantic representation;Anderson;Cerebral Cortex,2016

4. Multiple regions of a cortical network commonly encode the meaning of words in multiple grammatical positions of read sentences;Anderson;Cerebral Cortex,2018

5. Neural activation semantic models: Computational lexical semantic models of localized neural activations;Athanasiou,2018

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting the next sentence (not word) in large language models: What model-brain alignment tells us about discourse comprehension;Science Advances;2024-05-24

2. Probing the Representational Structure of Regular Polysemy via Sense Analogy Questions: Insights from Contextual Word Vectors;Cognitive Science;2024-03

3. Modeling Brain Representations of Words' Concreteness in Context Using GPT‐2 and Human Ratings;Cognitive Science;2023-12

4. The good, the bad, and the ambivalent: Extrapolating affective values for 38,000+ Chinese words via a computational model;Behavior Research Methods;2023-11-15

5. Perceptional and actional enrichment for metaphor detection with sensorimotor norms;Natural Language Engineering;2023-09-20