Balinese character recognition on mobile application based on tesseract open source OCR engine
-
Published:2020-04-01
Issue:1
Volume:1516
Page:012017
-
ISSN:1742-6588
-
Container-title:Journal of Physics: Conference Series
-
language:
-
Short-container-title:J. Phys.: Conf. Ser.
Author:
Mudiarta I M D R,Atmaja I M D S,Suharsana I K,Antara I W G S,Bharaditya I W P,Suandirat G A,Indrawan G
Abstract
Abstract
Balinese script is a part of Balinese culture is rarely used today. The Provincial Government of Bali with the Governor Regulation number 80 of 2018 is trying to preserve the Balinese language and script. This study aimed at preserving the Balinese script through a mobile technology approach which is the recent trend with worldwide coverage for supporting ubiquitous learning. This research integrated the Android application to recognize Balinese characters in the form of images into text with Tesseract open source Optical Character Recognition (OCR) engine. The input of this application is a Balinese script image captured by a mobile camera or from a Balinese script image. The application recognized input image into text that can be further processed based on training data available in the application. The new Balinese script training data was created based on eighteen Balinese script’s basic syllables and numbers only. This application can be operated offline with mobile hardware that supports camera functions. The result for testing for 50-word, recognition was 62% obtained in good quality image-based Bali-Simbar font. This application can be further developed to recognize other character repertoire i.e., vowels (Akśara Suara), semi vowels (Arda Suara), additional syllables (Akśara Şwalalita), and sound killers (Pangangge Tengenan).
Subject
General Physics and Astronomy
Reference9 articles.
1. Latin-to-Balinese script transliteration method on mobile application: A comparison;Indrawan;Indones. J. Electr. Eng. Comput. Sci.,2018
2. A new method of Latin-to-balinese script transliteration based on noto sans balinese font and dictionary data structure;Indrawan,2019
3. Pengenalan Aksara Bali Dengan Metode Local Binary Pattern;Sari;eProceedings Eng.,2015
4. Tesseract OCR Engine;Smith,2007
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献