EI-RNN-based text generation for the static and dynamic isolated sign language videos

Author:

Subburaj S.1,Murugavalli S.2,Muthusenthil B.3

Affiliation:

1. Department of Computer Science and Engineering, Anna University, Chennai, India

2. Department of Computer Science and Engineering, Panimalar Engineering College Chennai City Campus, Chennai, India

3. Department of Computer Science and Engineering, SRM Valliammai Engineering College, SRM Nagar, Kattankulathur, India

Abstract

SLR, which assists hearing-impaired people to communicate with other persons by sign language, is considered as a promising method. However, as the features of some of the static SL could be the same as the feature in a single frame of dynamic Isolated Sign Language (ISL), the generation of accurate text corresponding to the SL is necessary during the SLR. Therefore, Edge-directed Interpolation-based Recurrent Neural Network (EI-RNN)-centered text generation with varied features of the static and dynamic Isolated SL is proposed in this article. Primarily, ISL videos are converted to frames and pre-processed with key frame extraction and illumination control. After that, the foreground is separated with the Symmetric Normalised Laplacian-centered Otsu Thresholding (SLOT) technique for finding accurate key points in the human pose. The human pose’s key points are extracted with the Media Pipeline Holistic (MPH) pipeline approach and to improve the features of the face and hand sign, the resultant frame is fused with the depth image. After that, to differentiate the static and dynamic actions, the action change in the fused frames is determined with a correlation matrix. After that, to engender the output text for the respective SL, features are extracted individually as of the static and dynamic frames. It is obtained from the analysis that when analogized to the prevailing models, the proposed EI-RNN’s translation accuracy is elevated by 2.05% in INCLUDE 50 Indian SL based Dataset and Top 1 Accuracy 2.44% and Top 10 accuracy, 1.71% improved in WLASL 100 American SL.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference22 articles.

1. A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition;Adaloglou;IEEE Transactions on Multimedia,2022

2. Two Dimensional Convolutional Neural Network Approach for Real-Time Bangla Sign Language Characters Recognition and Translation;Alam;SN Computer Science,2021

3. HelpingHearing-Impaired in Emergency Situations: A Deep Learning-BasedApproach;Areeb;IEEE Access,2022

4. Character-level arabic text generation from sign languagevideo using encoder–decoder model;Boukdir;Displays

5. Thai Sign LanguageRecognition: An Application of Deep Neural Network;Chaikaew;2021 Joint6th International Conference on Digital Arts, Media and Technologywith 4th ECTI Northern Section Conference on Electrical,Electronics, Computer and Telecommunication Engineering, ECTI DAMTand NCON,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3