Author:
BALDRIDGE JASON,OSBORNE MILES
Abstract
AbstractFor complex tasks such as parse selection, the creation of labelled training sets can be extremely costly. Resource-efficient schemes for creating informative labelled material must therefore be considered. We investigate the relationship between two broad strategies for reducing the amount of manual labelling necessary to train accurate parse selection models: ensemble models and active learning. We show that popular active learning methods for reducing annotation costs can be outperformed by instead using a model class which uses the available labelled data more efficiently. For this, we use a simple type of ensemble model called theLogarithmic Opinion Pool(LOP). We furthermore show that LOPs themselves can benefit from active learning. As predicted by a theoretical explanation of the predictive power of LOPs, a detailed analysis of active learning using LOPs shows that component model diversity is a strong predictor of successful LOP performance. Other contributions include a novel active learning method, a justification of our simulation studies using timing information, and cross-domain verification of our main ideas using text classification.
Publisher
Cambridge University Press (CUP)
Subject
Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software
Reference51 articles.
1. Kohavi R. and Wolpert D. (1996) Bias plus variance decomposition for zero-one loss functions. Proceedings of the 13th International Conference on Machine Learning, pp. 275–283, Bari. Morgan Kaufmann.
2. Melville P. and Mooney R. J. (2004) Diverse ensembles for active learning. Proceedings of the 21st International Conference on Machine Learning, pp. 584–591, Banff, Canada.
3. Baldridge J. and Osborne M. (2003) Active learning for HPSG parse selection. Proceedings of the 7th Conference on Natural Language Learning, Edmonton, Canada.
4. Hellan L. and Haugereid P. (2003) The NorSource grammar – an exercise in the Matrix grammar building design. Proceedings of Workshop on Multilingual Grammar Engineering, ESSLLI 2003, Wein.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献