Fine-Tuned Pre-Trained Model for Script Recognition-Reference-Cited by-同舟云学术

Fine-Tuned Pre-Trained Model for Script Recognition

Published:2021-10-01 Issue:5 Volume:6 Page:1297-1314
ISSN:2455-7749
Container-title:International Journal of Mathematical, Engineering and Management Sciences
language:en
Short-container-title:Int J Math, Eng, Manag Sci

Author:

Bisht Mamta¹,Gupta Richa¹

Affiliation:

1. Department of Electronics and Communication Engineering, Jaypee Institute of Information Technology, Noida, India.

Abstract

Script recognition is the first necessary preliminary step for text recognition. In the deep learning era, for this task two essential requirements are the availability of a large labeled dataset for training and computational resources to train models. But if we have limitations on these requirements then we need to think of alternative methods. This provides an impetus to explore the field of transfer learning, in which the previously trained model knowledge established in the benchmark dataset can be reused in another smaller dataset for another task, thus saving computational power as it requires to train only less number of parameters from the total parameters in the model. Here we study two pre-trained models and fine-tune them for script classification tasks. Firstly, the VGG-16 pre-trained model is fine-tuned for publically available CVSI-15 and MLe2e datasets for script recognition. Secondly, a well-performed model on Devanagari handwritten characters dataset has been adopted and fine-tuned for the Kaggle Devanagari numeral dataset for numeral recognition. The performance of proposed fine-tune models is related to the nature of the target dataset as similar or dissimilar from the original dataset and it has been analyzed with widely used optimizers.

Publisher

International Journal of Mathematical, Engineering and Management Sciences plus Mangey Ram

Subject

General Engineering,General Business, Management and Accounting,General Mathematics,General Computer Science

Reference29 articles.

1. Alabau, V., Sanchis, A., & Casacuberta, F. (2014). Improving on-line handwritten recognition in interactive machine translation. Pattern Recognition, 47(3), 1217–1228. Doi: 10.1016/j.patcog.2013.09.035.

2. Bhunia, A.K., Konwer, A., Bhunia, A.K., Bhowmick, A., Roy, P.P., & Pal, U. (2019). Script identification in natural scene image and video frames using an attention based convolutional-LSTM network. Pattern Recognition, 85, 172–184. Doi: 10.1016/j.patcog.2018.07.034.

3. Bisht, M., & Gupta, R. (2020). Multiclass recognition of offline handwritten Devanagari characters using CNN. International Journal of Mathematical, Engineering and Management Sciences, 5(6), 1429–1439.

4. Chen, J., Chen, J., Zhang, D., Sun, Y., & Nanehkaran, Y.A. (2020). Using deep transfer learning for image-based plant disease identification. Computers and Electronics in Agriculture, 173, 105393.

5. Ghosh, D., Dube, T., & Shivaprasad, A. (2010). Script recognition—a review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(12), 2142–2161. Doi:10.1109/TPAMI.2010.30.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improve Code Summarization via Prompt-Tuning CodeT5;Wuhan University Journal of Natural Sciences;2023-12

2. Fine-Tuning Pre-Trained CodeBERT for Code Search in Smart Contract;Wuhan University Journal of Natural Sciences;2023-06

3. Handwritten Devanagari Word Detection and Localization using Morphological Image Processing;2023 10th International Conference on Signal Processing and Integrated Networks (SPIN);2023-03-23

4. Diabetic Retinopathy Binary Image Classification Using Pyspark;International Journal of Mathematical, Engineering and Management Sciences;2022-10-01