Active convolutional neural networks sign language (ActiveCNN-SL) framework: a paradigm shift in deaf-mute communication-Reference-Cited by-同舟云学术

Active convolutional neural networks sign language (ActiveCNN-SL) framework: a paradigm shift in deaf-mute communication

Published:2024-06-01 Issue:6 Volume:57 Page:
ISSN:1573-7462
Container-title:Artificial Intelligence Review
language:en
Short-container-title:Artif Intell Rev

Author:

ZainEldin Hanaa^ORCID,Baghdadi Nadiah A.,Gamel Samah A.^ORCID,Aljohani Mansourah,Talaat Fatma M.^ORCID,Malki Amer,Badawy Mahmoud^ORCID,Elhosseini Mostafa^ORCID

Abstract

AbstractReal-time speech-to-text and text-to-speech technologies have significantly influenced the accessibility of communication for individuals who are deaf or mute. This research aims to assess the efficacy of these technologies in facilitating communication between deaf or mute individuals and those who are neither deaf nor mute. A mixed-method approach will incorporate qualitative and quantitative data collection and analysis techniques. The study will involve participants from deaf or mute and non-deaf or non-mute communities. The research will scrutinize the precision and efficiency of communication using these technologies and evaluate user experience and satisfaction. Furthermore, the study intends to pinpoint potential obstacles and limitations of these technologies and offer suggestions for enhancing their effectiveness in fostering inclusivity. The study proposes an active learning framework for sign language gesture recognition, termed Active Convolutional Neural Networks—Sign Language (ActiveCNN-SL). ActiveCNN-SL aims to minimize the labeled data required for training and augment the accuracy of sign language gesture recognition through iterative human feedback. This proposed framework holds the potential to enhance communication accessibility for deaf and mute individuals and encourage inclusivity across various environments. The proposed framework is trained using two primary datasets: (i) the Sign Language Gesture Images Dataset and (ii) the American Sign Language Letters (ASL)—v1. The framework employs Resnet50 and YoloV.8 to train the datasets. It has demonstrated high performance in terms of precision and accuracy. The ResNet model achieved a remarkable accuracy rate of 99.98% during training, and it also exhibited a validation accuracy of 100%, surpassing the baseline CNN and RNN models. The YOLOv8 model outperformed previous methods on the ASL alphabet dataset, achieving an overall mean average accuracy for all classes of 97.8%.

Funder

King Salman center For Disability Research

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10462-024-10792-5.pdf

Reference45 articles.

1. Alawwad RA, Bchir O, Ismail MMB (2021) Arabic sign language recognition using Faster RCNN. Int J Adv Comput Sci Appl (IJACSA) 12(3):692–700

2. Avola D et al (2018) Exploiting recurrent neural networks and leap motion controller for the recognition of sign language and semaphoric hand gestures. IEEE Trans Multimed 21(1):234–245

3. Barbhuiya AA, Karsh RK, Jain R (2021) CNN based feature extraction and classification for sign language. Multimed Tools Appl 80(2):3051–3069

4. Barbhuiya AA, Karsh RK, Jain R (2022) Gesture recognition from RGB images using convolutional neural network-attention based system. Concurr Comput: Pract Exp 34(24):e7230

5. Bilal A et al (2021a) Neuro-optimized numerical treatment of HIV infection model. Int J Biomath 14(05):2150033