Recognition of Ancient Tamil Characters from Epigraphical inscriptions using Raspberry Pi based Tesseract OCR

Author:

Magrina M. Merline1

Affiliation:

1. Assistant Professor, Department of ECE, Er. Perumal Manimekalai College of Engineering, Hosur, TamilNadu, , India

Abstract

Optical Character Recognition (OCR) is the process of identification of the printed text using photoelectric devices and computer software. It converts the inscribed text on the stones into machine encoded format. OCR is widely used in machine learning process like cognitive computing, machine translation, text to speech conversion and text mining.OCR is mainly used in the research fields like Character Recognition, Artificial Intelligence and Computer Vision. In this research, the recognition process is done using OCR, the inscribed character is processed using Raspberry Pi device on which it recognizes characters using Artificial Neural Network. This work mainly focuses on the recognition of ancient Tamil characters inscribed on stones to modern Tamil characters belong to 9th and 12th century characters. The input image is subjected to gray scale conversion process and enhanced using adaptive thresholding process. The output image is subjected to thinning process to reduce the pixel size of the image. Then the characters are classified using Artificial Neural Network Architecture and the classified characters are mapped to modern Tamil character using Unicode. The Artificial Neural Network has input layer, hidden layer of 15 neurons and output layer of 1 neuron to classify the characters. The accuracy of the constructed system for the recognition of epigraphical inscriptions is calculated. The above process is carried out in raspbian environment using python process.

Publisher

Technoscience Academy

Subject

General Medicine

Reference22 articles.

1. Anush Goel, Akash Sehrawat, Ankush Patil, Prashant Chougule and Supriya Khatavkar (2018), “Raspberry Pi based reader for blind people”, International Research Journal of Engineering and Technology (IRJET 2018), vol: 5, issue: 6, pp: 1639- 1642.

2. Elumalai .G, J. Sundar Rajan, P. Surya Prakash, V.L. Susruth and P.K. Sudharsanan (2018), “Design and Development of Tessaract– OCR based assistive system to convert captured text into voice output”, International Research Journal of Engineering and Technology (IRJET 2018), vol: 5, issue: 4, pp: 509 – 513.

3. Thiyagarajan, Saravanan Kumar, Praveen Kumar and Sakana (2018), “Implementation of Optical Character Recognition using Raspberry Pi for visually Challenged People”, International Journal of Engineering & Technology (IJET 2018), vol: 7, issue: 3, pp: 65 – 67.

4. Vishwanath Bharadwaja, Ananmy, Sarraf Nikhil, Vineetha (2018), “Implementation of Artificial Neural Network on Raspberry Pi for signal processing applications”, International Conference on Advances in Computing, Communications & Informatics (ICACCI 2018), pp: 1488 – 1491.

5. Beihai Tan, Chao Hu and Zepei Zhang (2017), “Character Recognition based on Corner Detection”, IEEE International Conference on Natural computation, Fuzzy System and Knowledge Discovery (ICNC-FSKD), pg: 503-507.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Enhancing epigraphy: a deep learning approach to recognize and analyze Tamil ancient inscriptions;Neural Computing and Applications;2024-08-09

2. Ancient tamil digits recognition using convolutional neural network;AIP Conference Proceedings;2024

3. ATCRI-21 Neural Network Based Character Recognition with Image Denoising in Ancient Epigraphs;2023 4th International Conference on Smart Electronics and Communication (ICOSEC);2023-09-20

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3