Improved CCG Parsing with Semi-supervised Supertagging-Reference-Cited by-同舟云学术

Improved CCG Parsing with Semi-supervised Supertagging

Published:2014-12 Issue: Volume:2 Page:327-338
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:TACL

Author:

Lewis Mike¹,Steedman Mark¹

Affiliation:

1. School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK,

Abstract

Current supervised parsers are limited by the size of their labelled training data, making improving them with unlabelled data an important goal. We show how a state-of-the-art CCG parser can be enhanced, by predicting lexical categories using unsupervised vector-space embeddings of words. The use of word embeddings enables our model to better generalize from the labelled data, and allows us to accurately assign lexical categories without depending on a POS-tagger. Our approach leads to substantial improvements in dependency parsing results over the standard supervised CCG parser when evaluated on Wall Street Journal (0.8%), Wikipedia (1.8%) and biomedical (3.4%) text. We compare the performance of two recently proposed approaches for classification using a wide variety of word embeddings. We also give a detailed error analysis demonstrating where using embeddings outperforms traditional feature sets, and showing how including POS features can decrease accuracy.

Publisher

MIT Press - Journals

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00186

Reference8 articles.

1. Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models

2. CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Building a supertagger for Spanish HPSG;Computer Speech & Language;2019-03

2. CCG supertagging via Bidirectional LSTM-CRF neural architecture;Neurocomputing;2018-03

3. Shortcut Sequence Tagging;Natural Language Processing and Chinese Computing;2018

4. Parsing with Traces: An O(n4) Algorithm and a Structural Representation;Transactions of the Association for Computational Linguistics;2017-12

5. CCG supertagging with bidirectional long short-term memory networks;Natural Language Engineering;2017-09-04