Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach-Reference-Cited by-同舟云学术

Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

Published:2018-11 Issue:3 Volume:11 Page:277-312
ISSN:1750-1245
Container-title:Word Structure
language:en
Short-container-title:Word Structure

Author:

Lapesa Gabriella¹,Kawaletz Lea²,Plag Ingo²,Andreou Marios²,Kisselew Max¹,Padó Sebastian¹

Affiliation:

1. University of Stuttgart, Germany

2. Heinrich-Heine-Universität Düsseldorf, Germany

Abstract

One of the central problems in the semantics of derived words is polysemy (see, for example, the recent contributions by Lieber 2016 and Plag et al. 2018 ). In this paper, we tackle the problem of disambiguating newly derived words in context by applying Distributional Semantics ( Firth 1957 ) to deverbal -ment nominalizations (e.g. bedragglement, emplacement). We collected a dataset containing contexts of low frequency deverbal -ment nominalizations (55 types, 406 tokens, see Appendix B) extracted from large corpora such as the Corpus of Contemporary American English. We chose low frequency derivatives because high frequency formations are often lexicalized and thus tend to not exhibit the kind of polysemous readings we are interested in. Furthermore, disambiguating low-frequency words presents an especially difficult task because there is little to no prior knowledge about these words from which their semantic properties can be extrapolated. The data was manually annotated according to eventive vs. non-eventive interpretations, allowing also an ambiguous label in those cases where the context did not disambiguate. Our question then was to what extent, and under which conditions, context-derived representations such as those of Distributional Semantics can be successfully employed in the disambiguation of low-frequency derivatives. Our results show that, first, our models are able to distinguish between eventive and non-eventive readings with some success. Second, very small context windows are sufficient to find the intended interpretation in the majority of cases. Third, ambiguous instances tend to be classified as events. Fourth, the performance of the classifier differed for different subcategories of nouns, with non-eventive derivatives being harder to classify correctly. We present indirect evidence that this is due to the semantic similarity of abstract non-eventive nouns to eventive nouns. Overall, this paper demonstrates that distributional semantic models can be fruitfully employed for the disambiguation of low frequency words in spite of the scarcity of available contextual information. 1

Publisher

Edinburgh University Press

Subject

Linguistics and Language,Language and Linguistics

Reference59 articles.

1. Lexical Meaning in Context

2. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Instances of bias: the gendered semantics of generic masculines in German revealed by instance vectors;Zeitschrift für Sprachwissenschaft;2024-08-06

2. Chapter 3. Actional nominalization in Present-Day English in the light of the Referenced Index of Competition;Linguistik Aktuell/Linguistics Today;2024-05-15

3. Systematic mappings of sound to meaning: A theoretical review;Psychonomic Bulletin & Review;2023-10-06

4. The Suffix ‑ment between the Available and the Unavailable;Anglia;2023-06-01

5. Chapter 7. Paradigmatic aspects of deverbal noun conversion in English;Paradigms in Word Formation;2022-09-15