When Errors Become the Rule-Reference-Cited by-同舟云学术

When Errors Become the Rule

Published:2014-04 Issue:4 Volume:46 Page:1-51
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Uneson Marcus¹

Affiliation:

1. Lund University, Lund, Sweden

Abstract

Transformation-based learning (TBL) is a machine learning method for, in particular, sequential classification, invented by Eric Brill [Brill 1993b, 1995a]. It is widely used within computational linguistics and natural language processing, but surprisingly little in other areas. TBL is a simple yet flexible paradigm, which achieves competitive or even state-of-the-art performance in several areas and does not overtrain easily. It is especially successful at catching local, fixed-distance dependencies and seamlessly exploits information from heterogeneous discrete feature types. The learned representation—an ordered list of transformation rules—is compact and efficient, with clear semantics. Individual rules are interpretable and often meaningful to humans. The present article offers a survey of the most important theoretical work on TBL, addressing a perceived gap in the literature. Because the method should be useful also outside the world of computational linguistics and natural language processing, a chief aim is to provide an informal but relatively comprehensive introduction, readable also by people coming from other specialities.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2534189

Reference99 articles.

1. Harold Abelson and Gerald J. Sussman. 1996. Structure and Interpretation of Computer Programs. MIT Press Cambridge. Harold Abelson and Gerald J. Sussman. 1996. Structure and Interpretation of Computer Programs. MIT Press Cambridge.

2. MITRE

3. Unsupervised learning of a rule-based Spanish Part of Speech tagger

4. Alignment link projection using transformation-based learning

5. A tree-based statistical language model for natural language speech recognition. Acoustics, Speech and Signal Processing;Bahl Lalit R.;IEEE Transactions,1989