Affiliation:
1. Department of Computer Science, Higher Technical School of Computer Engineering, University of Vigo, 32004 Ourense, Spain
Abstract
AbstractOne of the most active areas of research in semi-supervised learning has been to study methods for constructing good ensembles of classifiers. Ensemble systems are techniques that create multiple models and then combine them to produce improved results. These systems usually produce more accurate solutions than a single model would. Specially, multi-view ensemble systems improve the accuracy of text classification because they optimize the functions to exploit different views of the same input data. However, despite being more promising than the single-view approaches, document datasets often have no natural multiple views available. This study proposes an algorithm to generate a synthetic view from a standard text dataset. The model generates a new view from the standard bag-of-words approach using an algorithm based on hidden Markov models (HMMs). To show the effectiveness of the proposed HMM-based synthetic view generation method, it has been integrated in a co-training ensemble system and tested with four text corpora: Reuters, 20 Newsgroup, TREC Genomics and OHSUMED. The results obtained are promising, showing a significant increase in the efficiency of the ensemble system compared to a single-view approach.
Funder
European Union Seventh Framework Programme
BIOCAPS
Spanish Ministry of Economy and Competitiveness
University of Vigo
Publisher
Oxford University Press (OUP)
Reference26 articles.
1. Creating ensemble of diverse maximum entropy models;Audhkhasi;IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),2012
2. Concept recognition for extracting protein interaction relations from biomedical text;Baumgartner;Genome Biology,2008
3. An ensemble approach to multi-view multi-instance learning;Cano;Knowledge-Based Systems,2017
4. LIBSVM: a library for support vector machines;Chang;ACM Transactions on Intelligent Systems and Technology,2011
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献