Multi-Stroke Thai Finger-Spelling Sign Language Recognition System with Deep Learning-Reference-Cited by-同舟云学术

Multi-Stroke Thai Finger-Spelling Sign Language Recognition System with Deep Learning

Published:2021-02-04 Issue:2 Volume:13 Page:262
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Pariwat Thongpan,Seresangtakul Pusadee

Abstract

Sign language is a type of language for the hearing impaired that people in the general public commonly do not understand. A sign language recognition system, therefore, represents an intermediary between the two sides. As a communication tool, a multi-stroke Thai finger-spelling sign language (TFSL) recognition system featuring deep learning was developed in this study. This research uses a vision-based technique on a complex background with semantic segmentation performed with dilated convolution for hand segmentation, hand strokes separated using optical flow, and learning feature and classification done with convolution neural network (CNN). We then compared the five CNN structures that define the formats. The first format was used to set the number of filters to 64 and the size of the filter to 3 × 3 with 7 layers; the second format used 128 filters, each filter 3 × 3 in size with 7 layers; the third format used the number of filters in ascending order with 7 layers, all of which had an equal 3 × 3 filter size; the fourth format determined the number of filters in ascending order and the size of the filter based on a small size with 7 layers; the final format was a structure based on AlexNet. As a result, the average accuracy was 88.83%, 87.97%, 89.91%, 90.43%, and 92.03%, respectively. We implemented the CNN structure based on AlexNet to create models for multi-stroke TFSL recognition systems. The experiment was performed using an isolated video of 42 Thai alphabets, which are divided into three categories consisting of one stroke, two strokes, and three strokes. The results presented an 88.00% average accuracy for one stroke, 85.42% for two strokes, and 75.00% for three strokes.

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/13/2/262/pdf

Reference26 articles.

1. Multi-modality American Sign Language recognition

2. Hand Sign Recognition for Thai Finger Spelling: an Application of Convolution Neural Network

3. Thai Finger-Spelling Sign Language Recognition Employing PHOG and Local Features with KNN;Pariwat;Int. J. Adv. Soft Comput. Appl.,2019

4. Thai finger-spelling sign language recognition using global and local features with SVM

5. Thai sign language recognition by using geometric invariant feature and ANN classification

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A neural-network based web application on real-time recognition of Pakistani sign language;Engineering Applications of Artificial Intelligence;2024-09

2. Deep multimodal-based finger spelling recognition for Thai sign language: a new benchmark and model composition;Machine Vision and Applications;2024-05-31

3. Comparing the Coordinate-Sequencing for Human's Joints Models on Thai Finger Spelling;2024 12th International Electrical Engineering Congress (iEECON);2024-03-06

4. Deep Multimodal-Based Recognition of Thai Finger Spelling with Two-Handed Postures;2024 12th International Electrical Engineering Congress (iEECON);2024-03-06

5. Recent Advances on Deep Learning for Sign Language Recognition;Computer Modeling in Engineering & Sciences;2024