In-situ identification and recognition of multi-hand gestures using optimized deep residual network-Reference-Cited by-同舟云学术

In-situ identification and recognition of multi-hand gestures using optimized deep residual network

Published:2021-12-16 Issue:6 Volume:41 Page:6983-6997
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Rubin Bose S.¹,Sathiesh Kumar V.¹

Affiliation:

1. Department of Electronics Engineering, Madras Institute of Technology Campus, Anna University, Chennai, India

Abstract

The real-time perception of hand gestures in a deprived environment is a demanding machine vision task. The hand recognition operations are more strenuous with different illumination conditions and varying backgrounds. Robust recognition and classification are the vital steps to support effective human-machine interaction (HMI), virtual reality, etc. In this paper, the real-time hand action recognition is performed by using an optimized Deep Residual Network model. It incorporates a RetinaNet model for hand detection and a Depthwise Separable Convolutional (DSC) layer for precise hand gesture recognition. The proposed model overcomes the class imbalance problems encountered by the conventional single-stage hand action recognition algorithms. The integrated DSC layer reduces the computational parameters and enhances the recognition speed. The model utilizes a ResNet-101 CNN architecture as a Feature extractor. The model is trained and evaluated on the MITI-HD dataset and compared with the benchmark datasets (NUSHP-II, Senz-3D). The network achieved a higher Precision and Recall value for an IoU value of 0.5. It is realized that the RetinaNet-DSC model using ResNet-101 backbone network obtained higher Precision (99.21 %for AP0.5, 96.80%for AP0.75) for MITI-HD Dataset. Higher performance metrics are obtained for a value of γ= 2 and α= 0.25. The SGD with a momentum optimizer outperformed the other optimizers (Adam, RMSprop) for the datasets considered in the studies. The prediction time of the optimized deep residual network is 82 ms.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference28 articles.

1. Hand Gesture Recognition with Skin Detection and Deep Learning Method;Huang;IOP: Journal of Physics: Conf. Series,2019

2. Abavisani M. , Joze H.R.V. and Patel V.M. , Improving the performance of unimodal dynamic hand-gesture recognition with multimodal training, IEEE Explorer IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 1165–1174.

3. Hand gesture recognition in automotive human–machine interaction using depth cameras;Zengeler;Sensors,2019

4. Tripathi P. , Keshari R. , Ghosh S. , Vatsa M. and Singh R. , AUTO-G: Gesture Recognition in the Crowd for Autonomous Vehicle, IEEE Explorer, International Conference on Image Processing (ICIP), Taipei, Taiwan, (2019), 3482–3486.

5. An efficient hand gesture recognition system based on deep CNN”;Chung;In Proceedings of the 2019 IEEE International Conference on Industrial Technology (ICIT), Melbourne, Australia,2019

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The use of CNNs in VR/AR/MR/XR: a systematic literature review;Virtual Reality;2024-08-30

2. Precise Hand Gesture Recognition under Hard Visual Environments using Deep Architecture;SN Computer Science;2024-01-20

3. In-situ enhanced anchor-free deep CNN framework for a high-speed human-machine interaction;Engineering Applications of Artificial Intelligence;2023-11

4. mIV3Net: modified inception V3 network for hand gesture recognition;Multimedia Tools and Applications;2023-06-23

5. Vision Based Real-Time Active Protection System Using Deep Convolutional Neural Network;2023 International Conference on Bio Signals, Images, and Instrumentation (ICBSII);2023-03-16