Efficient YOLO-Based Deep Learning Model for Arabic Sign Language Recognition-Reference-Cited by-同舟云学术

Efficient YOLO-Based Deep Learning Model for Arabic Sign Language Recognition

Published:2024-05-07 Issue:4 Volume:3 Page:
ISSN:1658-9912
Container-title:Journal of Disability Research
language:en
Short-container-title:

Author:

Al Ahmadi Saad¹,Mohammad Farah¹^ORCID,Al Dawsari Haya¹

Affiliation:

1. Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh 11543, Saudi Arabia

Abstract

Verbal communication is the dominant form of self-expression and interpersonal communication. Speech is a considerable obstacle for individuals with disabilities, including those who are deaf, hard of hearing, mute, and nonverbal. Sign language is a complex system of gestures and visual signs facilitating individual communication. With the help of artificial intelligence, the hearing and the deaf can communicate more easily. Automatic detection and recognition of sign language is a complex and challenging task in computer vision and machine learning. This paper proposes a novel technique using deep learning to recognize the Arabic Sign Language (ArSL) accurately. The proposed method relies on advanced attention mechanisms and convolutional neural network architecture integrated with a robust You Only Look Once (YOLO) object detection model that improves the detection and recognition rate of the proposed technique. In our proposed method, we integrate the self-attention block, channel attention module, spatial attention module, and cross-convolution module into feature processing for accurate detection. The recognition accuracy of our method is significantly improved, with a higher detection rate of 99%. The methodology outperformed conventional methods, achieving a precision rate of 0.9 and a mean average precision (mAP) of 0.9909 at an intersection over union (IoU) of 0.5. From IoU thresholds of 0.5 to 0.95, the mAP continuously remains high, indicating its effectiveness in accurately identifying signs at different precision levels. The results show the model’s robustness in accurately detecting and classifying complex multiple ArSL signs. The results show the robustness and efficacy of the proposed model.

Publisher

King Salman Center for Disability Research

Link

https://scienceopen.com/document_file/963ce385-fd62-4cf3-bf47-198cad719531/ScienceOpen/jdr20240051.pdf

Reference43 articles.

1. A survey on sign language literature;M Alaghband;Mach. Learn. Appl,2023

2. Arabic Sign Language recognition using convolutional neural network and mobilenet;E Aldhahri;Arab. J. Sci. Eng,2023

3. Recognition of gestures in Arabic Sign Language using neuro-fuzzy systems;O Al-Jarrah;Artif. Intell,2001

4. Arabic Sign Language letters recognition using vision transformer;AF Alnabih;Multimed. Tools Appl,2024

5. DeepArSLR: a novel signer-independent deep learning framework for isolated Arabic Sign Language gestures recognition;S Aly;IEEE Access,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Real-time Arabic avatar for deaf-mute communication enabled by deep learning sign language translation;Computers and Electrical Engineering;2024-10