Discriminative Reranking for Natural Language Parsing-Reference-Cited by-同舟云学术

Discriminative Reranking for Natural Language Parsing

Published:2005-03 Issue:1 Volume:31 Page:25-70
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Collins Michael¹,Koo Terry²

Affiliation:

1. Massachusetts Institute of Technology MIT Computer Science and Artificial Intelligence Laboratory (CSAIL), the Stata Center, Building 32, 32 Vassar Street, Cambridge, MA 02139.

2. Massachusetts Institute of Technology

Abstract

This article considers approaches which rerank the output of an existing probabilistic parser. The base parser produces a set of candidate parses for each input sentence, with associated probabilities that define an initial ranking of these parses. A second model then attempts to improve upon this initial ranking, using additional features of the tree as evidence. The strength of our approach is that it allows a tree to be represented as an arbitrary set of features, without concerns about how these features interact or overlap and without the need to define a derivation or a generative model which takes these features into account. We introduce a new method for the reranking task, based on the boosting approach to ranking problems described in Freund et al. (1998). We apply the boosting method to parsing the Wall Street Journal treebank. The method combined the log-likelihood under a baseline model (that of Collins [1999]) with evidence from an additional 500,000 features over parse trees that were not included in the original model. The new model achieved 89.75% F-measure, a 13% relative decrease in F-measure error over the baseline model's score of 88.2%. The article also introduces a new algorithm for the boosting approach which takes advantage of the sparsity of the feature space in the parsing data. Experiments show significant efficiency gains for the new algorithm over the obvious implementation of the boosting approach. We argue that the method is an appealing alternative-in terms of both simplicity and efficiency-to work on feature selection methods within log-linear (maximum-entropy) models. Although the experiments in this article are on natural language parsing (NLP), the approach should be applicable to many other NLP problems which are naturally framed as ranking tasks, for example, speech recognition, machine translation, or natural language generation.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/0891201053630273

Reference13 articles.

1. Inducing features of random fields

2. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Cited by 79 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation;Transactions of the Association for Computational Linguistics;2024

2. Generalization Guarantees for Multi-Item Profit Maximization: Pricing, Auctions, and Randomized Mechanisms;Operations Research;2023-12-13

3. N-Best Hypotheses Reranking for Text-to-SQL Systems;2022 IEEE Spoken Language Technology Workshop (SLT);2023-01-09

4. L2QA: Long Legal Article Question Answering with Cascaded Key Segment Learning;Database Systems for Advanced Applications;2023

5. A Functional Contextual Account of Background Knowledge in Categorization: Implications for Artificial General Intelligence and Cognitive Accounts of General Knowledge;Frontiers in Psychology;2022-03-02