End-to-End Ultrasonic Hand Gesture Recognition-Reference-Cited by-同舟云学术

End-to-End Ultrasonic Hand Gesture Recognition

Published:2024-04-25 Issue:9 Volume:24 Page:2740
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Fertl Elfi¹²^ORCID,Nguyen Do Dinh Tan¹,Krueger Martin¹,Stettinger Georg¹,Padial-Allué Rubén²^ORCID,Castillo Encarnación²^ORCID,Cuéllar Manuel P.³^ORCID

Affiliation:

1. Infineon Technologies AG, 85579 Neubiberg, Germany

2. Department of Electronics and Computer Technology, University of Granada, 18071 Granada, Spain

3. Department of Computer Science and Artificial Intelligence, University of Granada, 18071 Granada, Spain

Abstract

As the number of electronic gadgets in our daily lives is increasing and most of them require some kind of human interaction, this demands innovative, convenient input methods. There are limitations to state-of-the-art (SotA) ultrasound-based hand gesture recognition (HGR) systems in terms of robustness and accuracy. This research presents a novel machine learning (ML)-based end-to-end solution for hand gesture recognition with low-cost micro-electromechanical (MEMS) system ultrasonic transducers. In contrast to prior methods, our ML model processes the raw echo samples directly instead of using pre-processed data. Consequently, the processing flow presented in this work leaves it to the ML model to extract the important information from the echo data. The success of this approach is demonstrated as follows. Four MEMS ultrasonic transducers are placed in three different geometrical arrangements. For each arrangement, different types of ML models are optimized and benchmarked on datasets acquired with the presented custom hardware (HW): convolutional neural networks (CNNs), gated recurrent units (GRUs), long short-term memory (LSTM), vision transformer (ViT), and cross-attention multi-scale vision transformer (CrossViT). The three last-mentioned ML models reached more than 88% accuracy. The most important innovation described in this research paper is that we were able to demonstrate that little pre-processing is necessary to obtain high accuracy in ultrasonic HGR for several arrangements of cost-effective and low-power MEMS ultrasonic transducer arrays. Even the computationally intensive Fourier transform can be omitted. The presented approach is further compared to HGR systems using other sensor types such as vision, WiFi, radar, and state-of-the-art ultrasound-based HGR systems. Direct processing of the sensor signals by a compact model makes ultrasonic hand gesture recognition a true low-cost and power-efficient input method.

Funder

Infineon Technologies AG

Bundesministerium für Wirtschaft und Energie

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/9/2740/pdf

Reference46 articles.

1. Future Trends and Current State of Smart City Concepts: A Survey;Kirimtat;IEEE Access,2020

2. Hamad, A., and Jia, B. (2022). How Virtual Reality Technology Has Changed Our Lives: An Overview of the Current and Potential Applications and Limitations. Int. J. Environ. Res. Public Health, 19.

3. Fu, J., Rota, A., Li, S., Zhao, J., Liu, Q., Iovene, E., Ferrigno, G., and De Momi, E. (2023). Recent Advancements in Augmented Reality for Robotic Applications: A Survey. Actuators, 12.

4. Human-Machine Interaction Sensing Technology Based on Hand Gesture Recognition: A Review;Guo;IEEE Trans. Human-Mach. Syst.,2021

5. Oudah, M., Al-Naji, A., and Chahl, J. (2020). Hand Gesture Recognition Based on Computer Vision: A Review of Techniques. J. Imaging, 6.