Measuring Word Meaning in Context-Reference-Cited by-同舟云学术

Measuring Word Meaning in Context

Published:2013-09 Issue:3 Volume:39 Page:511-554
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Erk Katrin¹,McCarthy Diana²,Gaylord Nicholas¹

Affiliation:

1. University of Texas at Austin

2. University of Cambridge

Abstract

Word sense disambiguation (WSD) is an old and important task in computational linguistics that still remains challenging, to machines as well as to human annotators. Recently there have been several proposals for representing word meaning in context that diverge from the traditional use of a single best sense for each occurrence. They represent word meaning in context through multiple paraphrases, as points in vector space, or as distributions over latent senses. New methods of evaluating and comparing these different representations are needed. In this paper we propose two novel annotation schemes that characterize word meaning in context in a graded fashion. In WSsim annotation, the applicability of each dictionary sense is rated on an ordinal scale. Usim annotation directly rates the similarity of pairs of usages of the same lemma, again on a scale. We find that the novel annotation schemes show good inter-annotator agreement, as well as a strong correlation with traditional single-sense annotation and with annotation of multiple lexical paraphrases. Annotators make use of the whole ordinal scale, and give very fine-grained judgments that “mix and match” senses for each individual usage. We also find that the Usim ratings obey the triangle inequality, justifying models that treat usage similarity as metric. There has recently been much work on grouping senses into coarse-grained groups. We demonstrate that graded WSsim and Usim ratings can be used to analyze existing coarse-grained sense groupings to identify sense groups that may not match intuitions of untrained native speakers. In the course of the comparison, we also show that the WSsim ratings are not subsumed by any static sense grouping.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00142

Reference86 articles.

1. Word Sense Disambiguation

2. Proceedings of the 4th International Workshop on Semantic Evaluations - SemEval '07

3. Choosing sense distinctions for WSD

4. Brown, Susan. 2010. Finding Meaning: Sense Inventories for Improved Word Sense Disambiguation. Ph.D. thesis, University of Colorado at Boulder.

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations;Transactions of the Association for Computational Linguistics;2024

2. Polysemy - Evidence from Linguistics, Behavioural Science and Contextualised Language Models;Computational Linguistics;2023-12-15

3. Truth be told: a corpus-based study of the cross-linguistic colexification of representational and (inter)subjective meanings;Corpus Linguistics and Linguistic Theory;2023-11-01

4. CONcreTEXT norms: Concreteness ratings for Italian and English words in context;PLOS ONE;2023-10-20

5. From Word Types to Tokens and Back: A Survey of Approaches to Word Meaning Representation and Interpretation;Computational Linguistics;2023-03-14