Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments-Reference-Cited by-同舟云学术

Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

Published:2020-11-07 Issue:21 Volume:20 Page:6360
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Xu Chi^ORCID,Zhou Jun^ORCID,Cai Wendi^ORCID,Jiang Yunkai^ORCID,Li Yongbo^ORCID,Liu Yi^ORCID

Abstract

Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/21/6360/pdf

Reference69 articles.

1. Human-Computer Interaction in Smart Environments

2. Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups

3. Real-time gesture recognition by learning and selective control of visual interest points

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Challenges and solutions for vision-based hand gesture interpretation: A review;Computer Vision and Image Understanding;2024-11

2. Dynamic Importance-Weighted Fusion Network Based on Dynamic Convolutions for Hand Posture Recognition: A Technique Based on Red, Green, Blue Plus Depth Cameras;IEEE Robotics & Automation Magazine;2024

3. Sample-Adapt Fusion Network for RGB-D Hand Detection in the Wild;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04

4. SA-Fusion: Multimodal Fusion Approach for Web-based Human-Computer Interaction in the Wild;Proceedings of the ACM Web Conference 2023;2023-04-30

5. Autonomous recognition and positioning of shield segments based on red, green, blue and depth information;Automation in Construction;2023-02