Sign Language Recognition with Multimodal Sensors and Deep Learning Methods

Author:

Lu Chenghong1ORCID,Kozakai Misaki1,Jing Lei1ORCID

Affiliation:

1. Graduate School of Computer Science and Engineering, University of Aizu, Tsuruga, Ikki-machi, Aizuwakamatsu 965-8580, Japan

Abstract

Sign language recognition is essential in hearing-impaired people’s communication. Wearable data gloves and computer vision are partially complementary solutions. However, sign language recognition using a general monocular camera suffers from occlusion and recognition accuracy issues. In this research, we aim to improve accuracy through data fusion of 2-axis bending sensors and computer vision. We obtain the hand key point information of sign language movements captured by a monocular RGB camera and use key points to calculate hand joint angles. The system achieves higher recognition accuracy by fusing multimodal data of the skeleton, joint angles, and finger curvature. In order to effectively fuse data, we spliced multimodal data and used CNN-BiLSTM to extract effective features for sign language recognition. CNN is a method that can learn spatial information, and BiLSTM can learn time series data. We built a data collection system with bending sensor data gloves and cameras. A dataset was collected that contains 32 Japanese sign language movements of seven people, including 27 static movements and 5 dynamic movements. Each movement is repeated 10 times, totaling about 112 min. In particular, we obtained data containing occlusions. Experimental results show that our system can fuse multimodal information and perform better than using only skeletal information, with the accuracy increasing from 68.34% to 84.13%.

Funder

JSPS KAKENHI

JKA Foundation

NEDO Intensive Support for Young Promising Researchers

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference30 articles.

1. World Health Organization (2023, September 18). World Report on Hearing. Available online: https://www.who.int/news-room/fact-sheets/detail/deafness-and-hearing-loss.

2. Machine learning methods for sign language recognition: A critical review and analysis;Adeyanju;Intell. Syst. Appl.,2021

3. Technological Solutions for Sign Language Recognition: A Scoping Review of Research Trends, Challenges, and Opportunities;Joksimoski;IEEE Access,2022

4. Amin, M.S., Rizvi, S.T.H., and Hossain, M.M. (2022). A Comparative Review on Applications of Different Sensors for Sign Language Recognition. J. Imaging, 8.

5. Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues;Khalid;IEEE Access,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3