Affiliation:
1. Institute of Technology, Debre Markos University, Debre Markos P.O. Box 269, Ethiopia
2. Department of Computer Science and Information Engineering, National Taipei University of Technology (Taipei Tech), Taipei City 106, Taiwan
3. Samsung Display Vietnam (SDV), Yen Phong Industrial Park, Bac Ninh 16000, Vietnam
Abstract
Offline handwritten text recognition (HTR) is a long-standing research project for a wide range of applications, including assisting visually impaired users, humans and robot interactions, and the automatic entry of business documents. However, due to variations in writing styles, visual similarities between different characters, overlap between characters, and source document noise, designing an accurate and flexible HTR system is challenging. The problem becomes serious when the algorithm has a low learning capacity and when the text used is complex and has a lot of characters in the writing system, such as Ethiopic script. In this paper, we propose a new model that recognizes offline handwritten Ethiopic text using a gated convolution and stacked self-attention encoder–decoder network. The proposed model has a feature extraction layer, an encoder layer, and a decoder layer. The feature extraction layer extracts high-dimensional invariant feature maps from the input handwritten image. Using the extracted feature maps, the encoder and decoder layers transcribe the corresponding text. For the training and testing of the proposed model, we prepare an offline handwritten Ethiopic text-line dataset (HETD) with 2800 samples and a handwritten Ethiopic word dataset (HEWD) with 10,540 samples obtained from 250 volunteers. The experiment results of the proposed model on HETD show a 9.17 and 13.11 Character Error Rate (CER) and Word Error Rate (WER), respectively. However, the model on HEWD shows an 8.22 and 9.17 CER and WER, respectively. These results and the prepared datasets will be used as a baseline for future research.
Funder
National Science and Technology Council
Reference36 articles.
1. Online and Offline Handwritten Chinese Character Recognition: Benchmarking on New Databases;Liu;Pattern Recognit.,2013
2. Natarajan, P., Saleem, S., Prasad, R., MacRostie, E., and Subramanian, K. (2008). Arabic and Chinese Handwriting Recognition, Springer.
3. Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models;IEEE Trans. Pattern Anal. Mach. Intell.,2011
4. ImageNet Classification with Deep Convolutional Neural Networks;Krizhevsky;Adv. Neural Inf. Process Syst.,2012
5. Zhao, Y., Zhang, X., Fu, B., Zhan, Z., Sun, H., Li, L., and Zhang, G. (2022). Evaluation and Recognition of Handwritten Chinese Characters Based on Similarities. Appl. Sci., 12.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献