Selection of an Optimal Set of Features for Bengali Character Recognition-Reference-Cited by-同舟云学术

Selection of an Optimal Set of Features for Bengali Character Recognition

Published:2013 Issue: Volume: Page:96-116
ISSN:
Container-title:Technical Challenges and Design Issues in Bangla Language Processing
language:
Short-container-title:

Author:

Sarwar Hasan¹,Rahman Mizanur²,Akter Nasreen³,Hossain Saima⁴,Ahmed Sabrina⁵,Rahman Chowdhury Mofizur¹

Affiliation:

1. United International University, Bangladesh

2. Institute of Science and Technology (IST), Bangladesh

3. St. Francis Xavier University, Canada

4. LEADS Corporation Limited, Bangladesh

5. Local Government Engineering Department (LGED), Bangladesh

Abstract

Feature extraction is an essential step of Optical Character Recognition. Accurate and distinguishable feature plays a significant role to leverage the performance of a classifier. The complexity level of feature identification algorithm differs for alphabet sets of different languages. Apart from generic algorithms to find features of different alphabet sets, these algorithms take care of individual characteristic common for a particular alphabet set. Dominant features of one alphabet set might completely differ from that of another set. Since there always remains the chance that inaccurate features may cause inefficient recognition, special attention should be given to identify the set of optimal features of a character set. Bengali characters also have some specific issues apart from the existing issues of other character sets. For example, there are about 300 basic, modified, and compound character shapes in the script, the characters in a word are topologically connected, and Bengali is an inflectional language. Literature survey shows that several authors have used different features and classification algorithms. The authors have extensively reviewed all these feature sets. In order to identify an optimal feature set, variability analysis has been proposed here. They focus on the specific peculiarities of Bengali alphabet sets, its different usage as vowel and consonant signs, compound, complex, and touching characters. The authors also took care to generate easily computable features that take less time for generation. However, more attention needs to be given in order to choose an efficient classifier.

Publisher

IGI Global

Reference31 articles.

1. Abdullah, A. B. M., & Rahman, A. (2003). A survey on script segmentation for Bangla OCR: An implementation perspective. In Proceedings of 6th International Conference on Computer and Information Technology (ICCIT), (pp. 856-860). ICCIT.

2. Akter, N., Hossain, S., Islam, M. T., & Sarwar, H. (2008). An algorithm for segmenting modifiers from Bangla text. In Proceedings of 11th International Conference on Computer and Information Technology (ICCIT), (pp. 177 - 182). Khulna, Bangladesh: ICCIT.

3. Alam, M. M., & Anwer, M. (2005). Feature subset selection using genetic algorithm for Bengali handwritten digit recognition. In Proceedings of the National Conference on Computer Processing of Bangla, (pp. 258-263). Independent University.

4. A complete Bangla OCR system for printed characters.;M. M.Alam;Journal of Computer and Information Technology,2010

5. Design of a view based approach for Bengali character recognition.;S.Barman;International Journal of Advanced Science and Technology,2010

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Anatomy of Bengali Letterforms: A Semiotic Study;ICoRD’15 – Research into Design Across Boundaries Volume 1;2014-12-24