Affiliation:
1. National Institute of Development Administration
Abstract
This paper presents a Thai character recognition method based on topological properties. The method first extracts gradient features from a character image. A two-step classification are then applied to recognize the character. In the first step, a conditional random fields model is used to generate a set of possible characters. Then a nearest neighbor model based on hierarchical centroid distance is employed to finally recognize the character. The proposed method is trained by printed characters from documents and vehicle license plates. The technique is evaluated and found to have the recognition rate of 96.96%.
Publisher
Trans Tech Publications, Ltd.
Reference19 articles.
1. B. B. Chaudhuri and U. Garain. Extraction of Type Style-based Meta-information from Imaged Documents, Int. J. on Document Analysis and Recognition, (2001).
2. C. Sun, D. Si, Skew, and Slant. Correction for Document Images Using Gradient Direction. Proc. Int. Conf. On Document Analysis and Recognition (ICDAR), Vol. 1, 1997, pp.142-146.
3. L. Zhang, Y. Lu and C. L. Tan. Italic Font Recognition Using Stroke Pattern Analysis on Wavelet Decomposed Word Images. Proceedings of the 17th International Conference on Pattern Recognition, Vol. 4, 2004, pp.835-838.
4. F. Kimura and M. Shridhar. Handwritten numerical recognition based on multiple algorithms, Pattern Recognition, Vol. 24, 1991, pp.969-983.
5. S. Srisuk. Thai printed character recognition using the Hausdorff distance. Proceedings of National Computer Science and Engineering Conference (NCSEC), (1999).