Affiliation:
1. Department of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mullick Road, Kolkata-700032, West Bengal, India
Abstract
Abstract
In a multilingual country like India, script recognition is an important pre-processing footstep necessary for feeding any document to an optical character recognition (OCR) engine, which is, in general, script specific. The present work evaluates the performance of an ensemble of two MLP (multi-layer perceptron) classifiers, each trained on different feature sets. Here, two complementary sets of features, namely, gray-level co-occurrence matrix (GLCM) and Gabor wavelets transform coefficients are extracted from each of the handwritten text-line and word images written in 12 official scripts used in Indian subcontinent, which are then fed into an individual classifier. In order to improve the overall recognition rate, a powerful combination approach based on the Dempster–Shafer (DS) theory is finally employed to fuse the decisions of two MLP classifiers. The performance of the combined decision is compared with those of the individual classifiers, and it is noted that a significant improvement in recognition accuracy (about 4% for text-line data and 6% for word level data) has been achieved by the proposed methodology.
Subject
Artificial Intelligence,Information Systems,Software
Reference90 articles.
1. Identification of scripts of Indian languages by combining trainable classifiers,2000
2. Page-level script identification from multi-script handwritten documents,2015
3. Review of classifier combination methods,2008
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献