Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks-Reference-Cited by-同舟云学术

Finger Gesture Spotting from Long Sequences Based on Multi-Stream Recurrent Neural Networks

Published:2020-01-18 Issue:2 Volume:20 Page:528
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Benitez-Garcia Gibran^ORCID,Haris Muhammad^ORCID,Tsuda Yoshiyuki,Ukita Norimichi

Abstract

Gesture spotting is an essential task for recognizing finger gestures used to control in-car touchless interfaces. Automated methods to achieve this task require to detect video segments where gestures are observed, to discard natural behaviors of users’ hands that may look as target gestures, and be able to work online. In this paper, we address these challenges with a recurrent neural architecture for online finger gesture spotting. We propose a multi-stream network merging hand and hand-location features, which help to discriminate target gestures from natural movements of the hand, since these may not happen in the same 3D spatial location. Our multi-stream recurrent neural network (RNN) recurrently learns semantic information, allowing to spot gestures online in long untrimmed video sequences. In order to validate our method, we collect a finger gesture dataset in an in-vehicle scenario of an autonomous car. 226 videos with more than 2100 continuous instances were captured with a depth sensor. On this dataset, our gesture spotting approach outperforms state-of-the-art methods with an improvement of about 10% and 15% of recall and precision, respectively. Furthermore, we demonstrated that by combining with an existing gesture classifier (a 3D Convolutional Neural Network), our proposal achieves better performance than previous hand gesture recognition methods.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/2/528/pdf

Reference45 articles.

1. Gesticulation and Speech: Two Aspects of the Process of Utterance;Kendon,1980

2. Vision based hand gesture recognition for human computer interaction: a survey

3. Computer vision for assistive technologies

4. Industry use of virtual reality in product design and manufacturing: A survey;Berg;Virtual Real.,2017

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multiscale Bowel Sound Event Spotting in Highly Imbalanced Wearable Monitoring Data: Algorithm Development and Validation Study;JMIR AI;2024-07-10

2. Multi-scale Bowel Sound Event Spotting in Highly Imbalanced Wearable Monitoring Data: Algorithm Development and Validation (Preprint);2023-07-25

3. Deep Learning-Based Action Detection for Continuous Quality Control in Interactive Assistance Systems;Human-Technology Interaction;2022-12-14

4. YRAN2SAT: A novel flexible random satisfiability logical rule in discrete hopfield neural network;Advances in Engineering Software;2022-09

5. Simulation research on optimal extraction of architectural image features based on image sequence;2022 3rd Asia-Pacific Conference on Image Processing, Electronics and Computers;2022-04-14