Sign Language Recognition with Multimodal Sensors and Deep Learning Methods-Reference-Cited by-同舟云学术

Sign Language Recognition with Multimodal Sensors and Deep Learning Methods

Published:2023-11-29 Issue:23 Volume:12 Page:4827
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Lu Chenghong¹^ORCID,Kozakai Misaki¹,Jing Lei¹^ORCID

Affiliation:

1. Graduate School of Computer Science and Engineering, University of Aizu, Tsuruga, Ikki-machi, Aizuwakamatsu 965-8580, Japan

Abstract

Sign language recognition is essential in hearing-impaired people’s communication. Wearable data gloves and computer vision are partially complementary solutions. However, sign language recognition using a general monocular camera suffers from occlusion and recognition accuracy issues. In this research, we aim to improve accuracy through data fusion of 2-axis bending sensors and computer vision. We obtain the hand key point information of sign language movements captured by a monocular RGB camera and use key points to calculate hand joint angles. The system achieves higher recognition accuracy by fusing multimodal data of the skeleton, joint angles, and finger curvature. In order to effectively fuse data, we spliced multimodal data and used CNN-BiLSTM to extract effective features for sign language recognition. CNN is a method that can learn spatial information, and BiLSTM can learn time series data. We built a data collection system with bending sensor data gloves and cameras. A dataset was collected that contains 32 Japanese sign language movements of seven people, including 27 static movements and 5 dynamic movements. Each movement is repeated 10 times, totaling about 112 min. In particular, we obtained data containing occlusions. Experimental results show that our system can fuse multimodal information and perform better than using only skeletal information, with the accuracy increasing from 68.34% to 84.13%.

Funder

JSPS KAKENHI

JKA Foundation

NEDO Intensive Support for Young Promising Researchers

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/23/4827/pdf

Reference30 articles.

1. World Health Organization (2023, September 18). World Report on Hearing. Available online: https://www.who.int/news-room/fact-sheets/detail/deafness-and-hearing-loss.

2. Machine learning methods for sign language recognition: A critical review and analysis;Adeyanju;Intell. Syst. Appl.,2021

3. Technological Solutions for Sign Language Recognition: A Scoping Review of Research Trends, Challenges, and Opportunities;Joksimoski;IEEE Access,2022

4. Amin, M.S., Rizvi, S.T.H., and Hossain, M.M. (2022). A Comparative Review on Applications of Different Sensors for Sign Language Recognition. J. Imaging, 8.

5. Deep Learning for Sign Language Recognition: Current Techniques, Benchmarks, and Open Issues;Khalid;IEEE Access,2021