2D Camera-Based Air-Writing Recognition Using Hand Pose Estimation and Hybrid Deep Learning Model-Reference-Cited by-同舟云学术

2D Camera-Based Air-Writing Recognition Using Hand Pose Estimation and Hybrid Deep Learning Model

Published:2023-02-16 Issue:4 Volume:12 Page:995
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Watanabe Taiki¹,Maniruzzaman Md.¹^ORCID,Hasan Md. Al Mehedi²,Lee Hyoun-Sup³,Jang Si-Woong⁴,Shin Jungpil¹^ORCID

Affiliation:

1. School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu 965-8580, Fukushima, Japan

2. Department of Computer Science & Engineering, Rajshahi University of Engineering and Technology, Rajshahi 6204, Bangladesh

3. Department of Applied Software Engineering, Dongeui University, Busanjin-Gu, Busan 47340, Republic of Korea

4. Department of Computer Engineering, Dongeui University, Busanjin-Gu, Busan 47340, Republic of Korea

Abstract

Air-writing is a modern human–computer interaction technology that allows participants to write words or letters with finger or hand movements in free space in a simple and intuitive manner. Air-writing recognition is a particular case of gesture recognition in which gestures can be matched to write characters and digits in the air. Air-written characters show extensive variations depending on the various writing styles of participants and their speed of articulation, which presents quite a difficult task for effective character recognition. In order to solve these difficulties, this current work proposes an air-writing system using a web camera. The proposed system consists of two parts: alphabetic recognition and digit recognition. In order to assess our proposed system, two character datasets were used: an alphabetic dataset and a numeric dataset. We collected samples from 17 participants and asked each participant to write alphabetic characters (A to Z) and numeric digits (0 to 9) about 5–10 times. At the same time, we recorded the position of the fingertips using MediaPipe. As a result, we collected 3166 samples for the alphabetic dataset and 1212 samples for the digit dataset. First, we preprocessed the dataset and then created two datasets: image data and padding sequential data. The image data were fed into the convolution neural networks (CNN) model, whereas the sequential data were fed into bidirectional long short-term memory (BiLSTM). After that, we combined these two models and trained again with 5-fold cross-validation in order to increase the character recognition accuracy. In this work, this combined model is referred to as a hybrid deep learning model. Finally, the experimental results showed that our proposed system achieved an alphabet recognition accuracy of 99.3% and a digit recognition accuracy of 99.5%. We also validated our proposed system using another publicly available 6DMG dataset. Our proposed system provided better recognition accuracy compared to the existing system.

Funder

MSIT (Ministry of Science and ICT), Korea

Competitive Research Fund of The University of Aizu, Japan

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/4/995/pdf

Reference25 articles.

1. Airwriting: Bringing text entry to wearable computers;Amma;XRDS Crossroads, ACM Mag. Stud.,2013

2. Air-writing recognition using smart-bands;Yanay;Pervasive Mob. Comput.,2020

3. Vision based hand gesture recognition;Garg;Int. J. Comput. Inf. Eng.,2009

4. Air-writing recognition—Part I: Modeling and recognition of characters, words, and connecting motions;Chen;IEEE Trans. Hum.-Mach. Syst.,2015

5. Alam, M.S., Kwon, K.C., Alam, M.A., Abbass, M.Y., Imtiaz, S.M., and Kim, N. (2020). Trajectory-based air-writing recognition using deep neural network and depth sensor. Sensors, 20.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Augmented Reality Application for Improving Writing and Motoric Skills in Children With Disabilities;2024 47th MIPRO ICT and Electronics Convention (MIPRO);2024-05-20

2. Vision-Based Hand Rotation Recognition Technique with Ground-Truth Dataset;Applied Sciences;2024-01-03

3. A TinyDL Model for Gesture-Based Air Handwriting Arabic Numbers and Simple Arabic Letters Recognition;IEEE Access;2024

4. A Real-Time Hand Gesture Recognition Based on Media-Pipe and Support Vector Machine;Lecture Notes on Data Engineering and Communications Technologies;2024

5. An online multilingual numeral dataset on Devnagari and English languages for pattern recognition research;Data in Brief;2023-12