Abstract
AbstractRobust vision-based hand pose estimation is highly sought but still remains a challenging task, due to its inherent difficulty partially caused by self-occlusion among hand fingers. In this paper, an innovative framework for real-time static hand gesture recognition is introduced, based on an optimized shape representation build from multiple shape cues. The framework incorporates a specific module for hand pose estimation based on depth map data, where the hand silhouette is first extracted from the extremely detailed and accurate depth map captured by a time-of-flight (ToF) depth sensor. A hybrid multi-modal descriptor that integrates multiple affine-invariant boundary-based and region-based features is created from the hand silhouette to obtain a reliable and representative description of individual gestures. Finally, an ensemble of one-vs.-all support vector machines (SVMs) is independently trained on each of these learned feature representations to perform gesture classification. When evaluated on a publicly available dataset incorporating a relatively large and diverse collection of egocentric hand gestures, the approach yields encouraging results that agree very favorably with those reported in the literature, while maintaining real-time operation.
Funder
Bundesministerium f?r Bildung und Forschung
Publisher
Springer Science and Business Media LLC
Subject
Electrical and Electronic Engineering,Information Systems,Signal Processing
Reference40 articles.
1. S. Bakheet, A. Al-Hamadi, Hand gesture recognition using optimized local gabor features. J. Comput. Theor. Nanosci.14(2), 1–10 (2017).
2. S. K. Leem, F. Khan, S. H. Cho, Detecting mid-air gestures for digit writing with radio sensors and a CNN. IEEE Trans. Instrum. Meas.69(4), 1066–1081 (2020). https://doi.org/10.1109/TIM.2019.2909249.
3. R. Faugeroux, T. Vieira, D. Martinez, T. Lewiner, in 27th SIBGRAPI Conference on Graphics, Patterns and Images. Simplified training for gesture recognition, (2014), pp. 133–140. https://doi.org/10.1109/SIBGRAPI.2014.46.
4. J. Bransford, How people learn: brain, mind, experience, and school: expanded edition (2000) (National Academies Press, Washington, DC, 2000).
5. S. Riofrio, D. Pozo, J. Rosero, J. Vasquez. Gesture recognition using dynamic time warping and kinect: a practical approach, (2017), pp. 302–308. https://doi.org/10.1109/INCISCOS.2017.36.
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献