Contextualized word senses: from attention to compositionality-Reference-Cited by-同舟云学术

Contextualized word senses: from attention to compositionality

Published:2023-11-30 Issue:1 Volume:9 Page:191-203
ISSN:2199-174X
Container-title:Linguistics Vanguard
language:en
Short-container-title:

Author:

Gamallo Pablo¹

Affiliation:

1. Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS), Universidade de Santiago de Compostela , Galiza , Spain

Abstract

Abstract The neural architectures of language models are becoming increasingly complex, especially that of Transformers, based on the attention mechanism. Although their application to numerous natural language processing tasks has proven to be very fruitful, they continue to be models with little or no interpretability and explainability. One of the tasks for which they are best suited is the encoding of the contextual sense of words using contextualized embeddings. In this paper we propose a transparent, interpretable, and linguistically motivated strategy for encoding the contextual sense of words by modeling semantic compositionality. Particular attention is given to dependency relations and semantic notions such as selection preferences and paradigmatic classes. A partial implementation of the proposed model is carried out and compared with Transformer-based architectures for a given semantic task, namely the similarity calculation of word senses in context. The results obtained show that it is possible to be competitive with linguistically motivated models instead of using the black boxes underlying complex neural architectures.

Funder

Consellería de Cultura, Educación e Ordenación Universitaria

Publisher

Walter de Gruyter GmbH

Subject

Linguistics and Language,Language and Linguistics

Link

https://www.degruyter.com/document/doi/10.1515/lingvan-2022-0125/pdf

Reference53 articles.

1. Asher, Nicholas, Tim Van de Cruys, Antoine Bride & Márta Abrusán. 2016. Integrating type theory and distributional semantics: A case study on adjective–noun compositions. Computational Linguistics 42(4). 703–725. https://doi.org/10.1162/COLI_a_00264.

2. Baroni, Marco. 2013. Composition in distributional semantics. Language and Linguistics Compass 7. 511–522. https://doi.org/10.1111/lnc3.12050.

3. Baroni, Marco. 2020. Linguistic generalization and compositionality in modern artificial neural networks. Philosophical Transactions of the Royal Society B 375. 1–7. https://doi.org/10.1098/rstb.2019.0307.

4. Baroni, Marco, Raffaella Bernardi & Roberto Zamparelli. 2014. Frege in space: A program for compositional distributional semantics. Linguistic Issues in Language Technology (LiLT) 9. 241–346. https://doi.org/10.33011/lilt.v9i.1321.

5. Baroni, Marco & Roberto Zamparelli. 2010. Nouns are vectors, adjectives are matrices: Representing adjective-noun constructions in semantic space. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 1183–1193. Cambridge, MA: Association for Computational Linguistics. Available at: https://aclanthology.org/D10-1115.