Recognition of Arabic Air-Written Letters: Machine Learning, Convolutional Neural Networks, and Optical Character Recognition (OCR) Techniques

Author:

Nahar Khalid M. O.1ORCID,Alsmadi Izzat2ORCID,Al Mamlook Rabia Emhamed34,Nasayreh Ahmad1ORCID,Gharaibeh Hasan1,Almuflih Ali Saeed5ORCID,Alasim Fahad6

Affiliation:

1. Computer Science Department, Faculty of Information Technology and Computer Sciences, Yarmouk University, Irbid 21163, Jordan

2. Department of Computing and Cyber Security, Texas A&M University-San Antonio, San Antonio, TX 78224, USA

3. Department of Business Administration, Trine University, Angola, IN 49008, USA

4. Department of Mechanical and Industrial Engineering, University of Zawia, Tripoli 16418, Libya

5. Department of Industrial Engineering, College of Engineering, King Khalid University, Abha 62529, Saudi Arabia

6. Department of Industrial Engineering, College of Engineering, King Saud University, Riyadh 11495, Saudi Arabia

Abstract

Air writing is one of the essential fields that the world is turning to, which can benefit from the world of the metaverse, as well as the ease of communication between humans and machines. The research literature on air writing and its applications shows significant work in English and Chinese, while little research is conducted in other languages, such as Arabic. To fill this gap, we propose a hybrid model that combines feature extraction with deep learning models and then uses machine learning (ML) and optical character recognition (OCR) methods and applies grid and random search optimization algorithms to obtain the best model parameters and outcomes. Several machine learning methods (e.g., neural networks (NNs), random forest (RF), K-nearest neighbours (KNN), and support vector machine (SVM)) are applied to deep features extracted from deep convolutional neural networks (CNNs), such as VGG16, VGG19, and SqueezeNet. Our study uses the AHAWP dataset, which consists of diverse writing styles and hand sign variations, to train and evaluate the models. Prepossessing schemes are applied to improve data quality by reducing bias. Furthermore, OCR character (OCR) methods are integrated into our model to isolate individual letters from continuous air-written gestures and improve recognition results. The results of this study showed that the proposed model achieved the best accuracy of 88.8% using NN with VGG16.

Funder

the Deanship of Scientific Research, King Khalid University, Kingdom of Saudi Arabia

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Tifinagh Characters Recognition Using Deep CNN - LSTM;2024 4th International Conference on Emerging Smart Technologies and Applications (eSmarTA);2024-08-06

2. Construction Method and Practical Application of Oil and Gas Field Surface Engineering Case Database Based on Knowledge Graph;Processes;2024-05-25

3. Enhancing Tamil Handwritten Character Recognition Using Multimodel Deep Learning;2024 10th International Conference on Communication and Signal Processing (ICCSP);2024-04-12

4. Enhancing Arabic Handwritten Recognition System-Based CNN-BLSTM Using Generative Adversarial Networks;European Journal of Artificial Intelligence and Machine Learning;2024-04-02

5. Dhad—A Children’s Handwritten Arabic Characters Dataset for Automated Recognition;Applied Sciences;2024-03-10

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3