Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity-Reference-Cited by-同舟云学术

Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Published:2005-12 Issue:4 Volume:31 Page:439-475
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Weeds Julie¹,Weir David²

Affiliation:

1. University of Sussex

2. University of Sussex, Department of Informatics, University of Sussex, Falmer, Brighton, BN1 9QH, UK

Abstract

Techniques that exploit knowledge of distributional similarity between words have been proposed in many areas of Natural Language Processing. For example, in language modeling, the sparse data problem can be alleviated by estimating the probabilities of unseen co-occurrences of events from the probabilities of seen co-occurrences of similar events. In other applications, distributional similarity is taken to be an approximation to semantic similarity. However, due to the wide range of potential applications and the lack of a strict definition of the concept of distributional similarity, many methods of calculating distributional similarity have been proposed or adopted. In this work, a flexible, parameterized framework for calculating distributional similarity is proposed. Within this framework, the problem of finding distributionally similar words is cast as one of co-occurrence retrieval (CR) for which precision and recall can be measured by analogy with the way they are measured in document retrieval. As will be shown, a number of popular existing measures of distributional similarity are simulated with parameter settings within the CR framework. In this article, the CR framework is then used to systematically investigate three fundamental questions concerning distributional similarity. First, is the relationship of lexical similarity necessarily symmetric, or are there advantages to be gained from considering it as an asymmetric relationship? Second, are some co-occurrences inherently more salient than others in the calculation of distributional similarity? Third, is it necessary to consider the difference in the extent to which each word occurs in each co-occurrence type? Two application-based tasks are used for evaluation: automatic thesaurus generation and pseudo-disambiguation. It is possible to achieve significantly better results on both these tasks by varying the parameters within the CR framework rather than using other existing distributional similarity measures; it will also be shown that any single unparameterized measure is unlikely to be able to do better on both tasks. This is due to an inherent asymmetry in lexical substitutability and therefore also in lexical distributional similarity.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/089120105775299122

Reference10 articles.

1. Class-Based Probability Estimation Using a Semantic Hierarchy

Cited by 75 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Talking about Migration in Times of Crisis: A Textual Analysis of Narratives by IOM and UNHCR on Migrants and Refugees;American Behavioral Scientist;2023-06-24

2. How the concept of nature-based solutions for climate adaptation could be introduced in Master's curricula. Insights from France;Journal of Cleaner Production;2023-04

3. Detection for Cultural Difference in Impression Using Masked Language Model;Culture and Computing;2023

4. On-farm experimentation practices and associated farmer-researcher relationships: a systematic literature review;Agronomy for Sustainable Development;2022-11-30

5. Distributional Measures of Semantic Abstraction;Frontiers in Artificial Intelligence;2022-02-08