Parsing with Traces: An O(n4) Algorithm and a Structural Representation-Reference-Cited by-同舟云学术

Parsing with Traces: An O(n4) Algorithm and a Structural Representation

Published:2017-12 Issue: Volume:5 Page:441-454
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:TACL

Author:

Kummerfeld Jonathan K.¹,Klein Dan¹

Affiliation:

1. Computer Science Division, University of California, Berkeley, Berkeley, CA 94720, USA,

Abstract

General treebank analyses are graph structured, but parsers are typically restricted to tree structures for efficiency and modeling reasons. We propose a new representation and algorithm for a class of graph structures that is flexible enough to cover almost all treebank structures, while still admitting efficient learning and inference. In particular, we consider directed, acyclic, one-endpoint-crossing graph structures, which cover most long-distance dislocation, shared argumentation, and similar tree-violating linguistic phenomena. We describe how to convert phrase structure parses, including traces, to our new representation in a reversible manner. Our dynamic program uniquely decomposes structures, is sound and complete, and covers 97.3% of the Penn English Treebank. We also implement a proof-of-concept parser that recovers a range of null elements and trace types.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00072

Reference7 articles.

1. Hierarchical Phrase-Based Translation

2. Improved CCG Parsing with Semi-supervised Supertagging

3. A Crossing-Sensitive Third-Order Factorization for Dependency Parsing

4. Finding Optimal 1-Endpoint-Crossing Trees

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Interplay Between Loss Functions and Structural Constraints in Dependency Parsing;Northern European Journal of Language Technology;2019-12-20