EI-RNN-based text generation for the static and dynamic isolated sign language videos-Reference-Cited by-同舟云学术

EI-RNN-based text generation for the static and dynamic isolated sign language videos

Published:2023-10-26 Issue: Volume: Page:1-15
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Subburaj S.¹,Murugavalli S.²,Muthusenthil B.³

Affiliation:

1. Department of Computer Science and Engineering, Anna University, Chennai, India

2. Department of Computer Science and Engineering, Panimalar Engineering College Chennai City Campus, Chennai, India

3. Department of Computer Science and Engineering, SRM Valliammai Engineering College, SRM Nagar, Kattankulathur, India

Abstract

SLR, which assists hearing-impaired people to communicate with other persons by sign language, is considered as a promising method. However, as the features of some of the static SL could be the same as the feature in a single frame of dynamic Isolated Sign Language (ISL), the generation of accurate text corresponding to the SL is necessary during the SLR. Therefore, Edge-directed Interpolation-based Recurrent Neural Network (EI-RNN)-centered text generation with varied features of the static and dynamic Isolated SL is proposed in this article. Primarily, ISL videos are converted to frames and pre-processed with key frame extraction and illumination control. After that, the foreground is separated with the Symmetric Normalised Laplacian-centered Otsu Thresholding (SLOT) technique for finding accurate key points in the human pose. The human pose’s key points are extracted with the Media Pipeline Holistic (MPH) pipeline approach and to improve the features of the face and hand sign, the resultant frame is fused with the depth image. After that, to differentiate the static and dynamic actions, the action change in the fused frames is determined with a correlation matrix. After that, to engender the output text for the respective SL, features are extracted individually as of the static and dynamic frames. It is obtained from the analysis that when analogized to the prevailing models, the proposed EI-RNN’s translation accuracy is elevated by 2.05% in INCLUDE 50 Indian SL based Dataset and Top 1 Accuracy 2.44% and Top 10 accuracy, 1.71% improved in WLASL 100 American SL.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference22 articles.

1. A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition;Adaloglou;IEEE Transactions on Multimedia,2022

2. Two Dimensional Convolutional Neural Network Approach for Real-Time Bangla Sign Language Characters Recognition and Translation;Alam;SN Computer Science,2021

3. HelpingHearing-Impaired in Emergency Situations: A Deep Learning-BasedApproach;Areeb;IEEE Access,2022

4. Character-level arabic text generation from sign languagevideo using encoder–decoder model;Boukdir;Displays

5. Thai Sign LanguageRecognition: An Application of Deep Neural Network;Chaikaew;2021 Joint6th International Conference on Digital Arts, Media and Technologywith 4th ECTI Northern Section Conference on Electrical,Electronics, Computer and Telecommunication Engineering, ECTI DAMTand NCON,2021