Abstract
Sign language recognition presents significant challenges due to the intricate nature of hand gestures and the necessity to capture fine-grained details. In response to these challenges, a novel approach is proposed—Lightweight Attentive VGG16 with Random Forest (LAVRF) model. LAVRF introduces a refined adaptation of the VGG16 model integrated with attention modules, complemented by a Random Forest classifier. By streamlining the VGG16 architecture, the Lightweight Attentive VGG16 effectively manages complexity while incorporating attention mechanisms that dynamically concentrate on pertinent regions within input images, resulting in enhanced representation learning. Leveraging the Random Forest classifier provides notable benefits, including proficient handling of high-dimensional feature representations, reduction of variance and overfitting concerns, and resilience against noisy and incomplete data. Additionally, the model performance is further optimized through hyperparameter optimization, utilizing the Optuna in conjunction with hill climbing, which efficiently explores the hyperparameter space to discover optimal configurations. The proposed LAVRF model demonstrates outstanding accuracy on three datasets, achieving remarkable results of 99.98%, 99.90%, and 100% on the American Sign Language, American Sign Language with Digits, and NUS Hand Posture datasets, respectively.
Funder
Telekom Malaysia Berhad
Deanship of Scientific Research, King Khalid University
Publisher
Public Library of Science (PLoS)
Reference26 articles.
1. Hand gesture recognition based on computer vision: a review of techniques;M Oudah;journal of Imaging,2020
2. Zhou A, Muller R, Rabaey J. Memory-Efficient, Limb Position-Aware Hand Gesture Recognition using Hyperdimensional Computing; 2021.
3. Multimode gesture recognition algorithm based on convolutional long short-term memory network;MX Lu;Computational Intelligence and Neuroscience,2022
4. Human–computer interaction based on visual hand-gesture recognition using volumetric spatiograms of local binary patterns;AI Maqueda;Computer Vision and Image Understanding,2015
5. Static hand gesture recognition method based on the Vision Transformer;Y Zhang;Multimedia Tools and Applications,2023