Annotation for and Robust Parsing of Discourse Structure on Unrestricted Texts-Reference-Cited by-同舟云学术

Annotation for and Robust Parsing of Discourse Structure on Unrestricted Texts

Published:2007-01-20 Issue:2 Volume:26 Page:
ISSN:0721-9067
Container-title:Zeitschrift für Sprachwissenschaft
language:
Short-container-title:

Author:

Baldridge Jason,Asher Nicholas,Hunter Julie

Abstract

AbstractPredicting discourse structure on naturally occurring texts and dialogs is challenging and computationally intensive. Attempts to construct hand-built systems have run into problems both in how to specify the required knowledge and how to perform the necessary computations in an efficient manner. Data-driven approaches have recently been shown to be successful for handling challenging aspects of discourse without using lots of fine-grained semantic detail, but they require annotated material for training. We describe our effort to annotate Segmented Discourse Representation Structures on Wall Street Journal texts, arguing that graph-based representations are necessary for adequately capturing the dependencies found in the data. We then explore two data-driven parsing strategies for recovering discourse structures. We show that the generative PCFG model of Baldridge & Lascarides (2005b) is inherently limited by its inability to incorporate new features when learning from small data sets, and we show how recent developments in dependency parsing and discriminative learning can be utilized to get around this problem and thereby improve parsing accuracy. Results from exploratory experiments on Verbmobil dialogs and our annotated news wire texts are given; these results suggest that these methods do indeed enhance performance and have the potential for significant further improvements by developing richer feature sets.

Publisher

Walter de Gruyter GmbH

Subject

Linguistics and Language,Language and Linguistics

Link

https://www.degruyter.com/document/doi/10.1515/ZFS.2007.018/pdf

Reference25 articles.

1. Subordinating and coordinating discourse relations

2. Probabilistic head-driven parsing for discourse structure

3. Intricacies of Collins' Parsing Model

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From Discourse Relations to Network Edges: A Network Theory Approach to Discourse Analysis;Applied Sciences;2023-06-07

2. Argumentation Mining;Synthesis Lectures on Human Language Technologies;2018-12-20

3. Bibliography;Natural Language Processing and Computational Linguistics 2;2017-12-01

4. ANNODIS and Related Projects: Case Studies on the Annotation of Discourse Structure;Handbook of Linguistic Annotation;2017

5. The Penn Discourse Treebank: An Annotated Corpus of Discourse Relations;Handbook of Linguistic Annotation;2017