Classification of Printed Gujarati Characters Using Low-Level Stroke Features-Reference-Cited by-同舟云学术

Classification of Printed Gujarati Characters Using Low-Level Stroke Features

Published:2016-06-02 Issue:4 Volume:15 Page:1-26
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Goswami Mukesh M.¹^ORCID,Mitra Suman K.²

Affiliation:

1. Dharmsinh Desai University, Nadiad, Gujarat (India)

2. Dhirubhai Ambani Institute of Information and Communication Technology, Gandhinagar, Gujarat (India)

Abstract

This article presents an elegant technique for extracting the low-level stroke features, such as endpoints, junction points, line elements, and curve elements, from offline printed text using a template matching approach. The proposed features are used to classify a subset of characters from Gujarati script. The database consists of approximately 16,782 samples of 42 middle-zone symbols from the Gujarati character set collected from three different sources: machine printed books, newspapers, and laser printed documents. The purpose of this division is to add variety in terms of size, font type, style, ink variation, and boundary deformation. The experiments are performed on the database using a k-nearest neighbor (kNN) classifier and results are compared with other widely used structural features, namely Chain Codes (CC), Directional Element Features (DEF), and Histogram of Oriented Gradients (HoG). The results show that the features are quite robust against the variations and give comparable performance with other existing works.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2856105

Reference45 articles.

1. Government of India. 2001. Abstract of Speakers. Strength of Languages and Mother Tongues-2000. Census of India. (2001). Government of India. 2001. Abstract of Speakers. Strength of Languages and Mother Tongues-2000. Census of India. (2001).

2. K. Aparna and A. G. Ramakrishnan. 2002. A complete Tamil optical character recognition system. In Document Analysis Systems V Daniel Lopresti Jianying Hu and Ramanujan Kashi (Eds.). Springer Berlin 53--57. K. Aparna and A. G. Ramakrishnan. 2002. A complete Tamil optical character recognition system. In Document Analysis Systems V Daniel Lopresti Jianying Hu and Ramanujan Kashi (Eds.). Springer Berlin 53--57.

3. Shape matching and object recognition using shape contexts

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Handwritten Text Recognition for Regional Languages of Indian Subcontinent;Algorithms for Intelligent Systems;2023

2. Handwritten Gujarati Numeral Recognition using Deep Learning;2022 2nd International Conference on Innovative Sustainable Computational Technologies (CISCT);2022-12-23

3. A Review on Optical Character Recognition of Gujarati Scripts;Proceedings of the 6th International Conference on Advance Computing and Intelligent Engineering;2022-09-22

4. Template-Based Thinning Method for Handwritten Gujarati Character’s Strokes and its Classification for Writer-Dependent Gujarati Font Synthesis;Lecture Notes in Electrical Engineering;2022

5. Handwritten Numeral Recognition Using Polar Histogram of Low-Level Stroke Features;Proceedings of 3rd International Conference on Computer Vision and Image Processing;2019-11-01