Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder-Reference-Cited by-同舟云学术

Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder

Published:2020-11-07 Issue:21 Volume:10 Page:7898
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Ashiquzzaman Akm,Lee Hyunmin,Kim Kwangki,Kim Hye-Young,Park Jaehyung,Kim Jinsul

Abstract

Current deep learning convolutional neural network (DCNN) -based hand gesture detectors with acute precision demand incredibly high-performance computing power. Although DCNN-based detectors are capable of accurate classification, the sheer computing power needed for this form of classification makes it very difficult to run with lower computational power in remote environments. Moreover, classical DCNN architectures have a fixed number of input dimensions, which forces preprocessing, thus making it impractical for real-world applications. In this research, a practical DCNN with an optimized architecture is proposed with DCNN filter/node pruning, and spatial pyramid pooling (SPP) is introduced in order to make the model input dimension-invariant. This compact SPP-DCNN module uses 65% fewer parameters than traditional classifiers and operates almost 3× faster than classical models. Moreover, the new improved proposed algorithm, which decodes gestures or sign language finger-spelling from videos, gave a benchmark highest accuracy with the fastest processing speed. This proposed method paves the way for various practical and applied hand gesture input-based human-computer interaction (HCI) applications.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/21/7898/pdf

Reference43 articles.

1. From mouth to hand: Gesture, speech, and the evolution of right-handedness

2. Encyclopedia of Database Systems;Liu,2009

3. Optimal Brain Damage;LeCun,1990

4. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Utilizing the Yolov8 Model for Accurate Hand Gesture Recognition with Complex Background;2024

2. Continuous word level sign language recognition using an expert system based on machine learning;International Journal of Cognitive Computing in Engineering;2023-06

3. Deep Learning for Highly Accurate Hand Recognition Based on Yolov7 Model;Big Data and Cognitive Computing;2023-03-22

4. Two-Dimensional Parallel Spatio-Temporal Pyramid Pooling for Hand Gesture Recognition;IEEE Access;2023

5. A Multi-scale Convolutional Neural Network for Skeleton-Based Human Action Recognition with Insufficient Training Samples;Lecture Notes in Electrical Engineering;2023