Distributional Memory: A General Framework for Corpus-Based Semantics-Reference-Cited by-同舟云学术

Distributional Memory: A General Framework for Corpus-Based Semantics

Published:2010-12 Issue:4 Volume:36 Page:673-721
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Baroni Marco¹,Lenci Alessandro²

Affiliation:

1. University of Trento

2. University of Pisa

Abstract

Research into corpus-based semantics has focused on the development of ad hoc models that treat single tasks, or sets of closely related tasks, as unrelated challenges to be tackled by extracting different kinds of distributional information from the corpus. As an alternative to this “one task, one model” approach, the Distributional Memory framework extracts distributional information once and for all from the corpus, in the form of a set of weighted word-link-word tuples arranged into a third-order tensor. Different matrices are then generated from the tensor, and their rows and columns constitute natural spaces to deal with different semantic problems. In this way, the same distributional information can be shared across tasks such as modeling word similarity judgments, discovering synonyms, concept categorization, predicting selectional preferences of verbs, solving analogy problems, classifying relations between word pairs, harvesting qualia structures with patterns or example pairs, predicting the typical properties of concepts, and classifying verbs into alternation classes. Extensive empirical testing in all these domains shows that a Distributional Memory implementation performs competitively against task-specific algorithms recently reported in the literature for the same tasks, and against our implementations of several state-of-the-art methods. The Distributional Memory approach is thus shown to be tenable despite the constraints imposed by its multi-purpose nature.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/coli_a_00016

Reference34 articles.

1. Quantum aspects of semantic analysis and symbolic artificial intelligence

2. Prepositions in Applications: A Survey and Introduction to the Special Issue

3. Strudel: A Corpus-Based Semantic Model Based on Properties and Types

Cited by 248 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Keystrokes: A practical exploration of semantic drift in timed word association tasks;PLOS ONE;2024-07-01

2. Compositionality, communication, and commitments;Synthese;2024-06-26

3. Training and evaluation of vector models for Galician;Language Resources and Evaluation;2024-06-04

4. Dissociating language and thought in large language models;Trends in Cognitive Sciences;2024-06

5. A study of concept similarity in Wikidata;Semantic Web;2024-05-14