Weighted Vote-Based Classifier Ensemble for Named Entity Recognition-Reference-Cited by-同舟云学术

Weighted Vote-Based Classifier Ensemble for Named Entity Recognition

Published:2011-06 Issue:2 Volume:10 Page:1-37
ISSN:1530-0226
Container-title:ACM Transactions on Asian Language Information Processing
language:en
Short-container-title:ACM Transactions on Asian Language Information Processing

Author:

Ekbal Asif¹,Saha Sriparna¹

Affiliation:

1. Indian Institute of Technology

Abstract

In this article, we report the search capability of Genetic Algorithm (GA) to construct a weighted vote-based classifier ensemble for Named Entity Recognition (NER). Our underlying assumption is that the reliability of predictions of each classifier differs among the various named entity (NE) classes. Thus, it is necessary to quantify the amount of voting of a particular classifier for a particular output class. Here, an attempt is made to determine the appropriate weights of voting for each class in each classifier using GA. The proposed technique is evaluated for four leading Indian languages, namely Bengali, Hindi, Telugu, and Oriya, which are all resource-poor in nature. Evaluation results yield the recall, precision and F-measure values of 92.08%, 92.22%, and 92.15%, respectively for Bengali; 96.07%, 88.63%, and 92.20%, respectively for Hindi; 78.82%, 91.26%, and 84.59%, respectively for Telugu; and 88.56%, 89.98%, and 89.26%, respectively for Oriya. Finally, we evaluate our proposed approach with the benchmark dataset of CoNLL-2003 shared task that yields the overall recall, precision, and F -measure values of 88.72%, 88.64%, and 88.68%, respectively. Results also show that the vote based classifier ensemble identified by the GA-based approach outperforms all the individual classifiers, three conventional baseline ensembles, and some other existing ensemble techniques. In a part of the article, we formulate the problem of feature selection in any classifier under the single objective optimization framework and show that our proposed classifier ensemble attains superior performance to it.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1967293.1967296

Reference64 articles.

1. Anderson T. W. and Scolve S. 1978. Introduction to the Statistical Analysis of Data. Houghton Mifflin. Anderson T. W. and Scolve S. 1978. Introduction to the Statistical Analysis of Data . Houghton Mifflin.

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An ensemble method based on weight voting method for improved prediction of slope stability;Natural Hazards;2024-04-15

2. Fusion-based approach for hydrometeorological drought modeling: a regional investigation for Iran;Environmental Science and Pollution Research;2024-03-13

3. Genetic Algorithm Optimized Stacking Approach to Skin Disease Detection;IEEE Access;2024

4. Research on Abnormal Risk Identification of Safe Power Consumption for Active Customers Based on Cluster Analysis;2023 3rd International Conference on Electrical Engineering and Control Science (IC2ECS);2023-12-29

5. Deviation-support based fuzzy ensemble of multi-modal deep learning classifiers for breast cancer prognosis prediction;Scientific Reports;2023-12-03