RELPRON: A Relative Clause Evaluation Data Set for Compositional Distributional Semantics-Reference-Cited by-同舟云学术

RELPRON: A Relative Clause Evaluation Data Set for Compositional Distributional Semantics

Published:2016-12 Issue:4 Volume:42 Page:661-701
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Rimell Laura¹,Maillard Jean¹,Polajnar Tamara¹,Clark Stephen¹

Affiliation:

1. University of Cambridge Computer Laboratory

Abstract

This article introduces RELPRON, a large data set of subject and object relative clauses, for the evaluation of methods in compositional distributional semantics. RELPRON targets an intermediate level of grammatical complexity between content-word pairs and full sentences. The task involves matching terms, such as “wisdom,” with representative properties, such as “quality that experience teaches.” A unique feature of RELPRON is that it is built from attested properties, but without the need for them to appear in relative clause format in the source corpus. The article also presents some initial experiments on RELPRON, using a variety of composition methods including simple baselines, arithmetic operators on vectors, and finally, more complex methods in which argument-taking words are represented as tensors. The latter methods are based on the Categorial framework, which is described in detail. The results show that vector addition is difficult to beat—in line with the existing literature—but that an implementation of the Categorial framework based on the Practical Lexical Function model is able to match the performance of vector addition. The article finishes with an in-depth analysis of RELPRON, showing how results vary across subject and object relative clauses, across different head nouns, and how the methods perform on the subtasks necessary for capturing relative clause semantics, as well as providing a qualitative analysis highlighting some of the more common errors. Our hope is that the competitive results presented here, in which the best systems are on average ranking one out of every two properties correctly for a given term, will inspire new approaches to the RELPRON ranking task and other tasks based on linguistically interesting constructions.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00263

Reference75 articles.

1. SemEval-2015 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability

2. Agirre, Eneko, Daniel Cer, Mona Diab, and Aitor Gonzalez-Agirre. 2012. SemEval-2012 Task 6: A pilot on Semantic Textual Similarity. In Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM 2012), pages 385–393, Montréal.

3. Agirre, Eneko, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, and WeiWei Guo. 2013. *SEM 2013 Shared Task: Semantic Textual Similarity. In Proceedings of the Second Joint Conference on Lexical and Computational Semantics (*SEM 2013), pages 32–43, Atlanta, GA.

4. Baroni, Marco, Raffaella Bernardi, and Roberto Zamparelli. 2014. Frege in space: A program for compositional distributional semantics. Linguistic Issues in Language Technologies, 9:5–110.

5. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Classical and Quantum Sequence Models;2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS);2024-01-03

2. Quantum Natural Language Processing: A Comprehensive Survey;IEEE Access;2024

3. Ensemble Learning Based Quantum Text Classifiers;New Trends in Database and Information Systems;2023

4. Quantum Natural Language Processing: Challenges and Opportunities;Applied Sciences;2022-06-02

5. Verb Metaphoric Extension Under Semantic Strain;Cognitive Science;2022-05