Affiliation:
1. Institute of Neural Information Processing, Ulm University, 89081 Ulm, Germany
2. Computer Science Engineering Department, German University in Cairo, Cairo 11835, Egypt
Abstract
Hand gestures are an essential part of human-to-human communication and interaction and, therefore, of technical applications. The aim is increasingly to achieve interaction between humans and computers that is as natural as possible, for example, by means of natural language or hand gestures. In the context of human-machine interaction research, these methods are consequently being explored more and more. However, the realization of natural communication between humans and computers is a major challenge. In the field of hand gesture recognition, research approaches are being pursued that use additional hardware, such as special gloves, to classify gestures with high accuracy. Recently, deep learning techniques using artificial neural networks have been increasingly proposed for the problem of gesture recognition without using such tools. In this context, we explore the approach of convolutional neural network (CNN) in detail for the task of hand gesture recognition. CNN is a deep neural network that can be used in the fields of visual object processing and classification. The goal of this work is to recognize ten types of static hand gestures in front of complex backgrounds and different hand sizes based on raw images without the use of extra hardware. We achieved good results with a CNN network architecture consisting of seven layers. Through data augmentation and skin segmentation, a significant increase in the model’s accuracy was achieved. On public benchmarks, two challenging datasets have been classified almost perfectly, with testing accuracies of 96.5% and 96.57%.
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference28 articles.
1. Amirian, M., Kächele, M., Palm, G., and Schwenker, F. (June, January 30). Support vector regression of sparse dictionary-based features for view-independent action unit intensity estimation. Proceedings of the 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA.
2. Hihn, H., Meudt, S., and Schwenker, F. (2016). Artificial Neural Networks in Pattern Recognition (ANNPR 2016), Proceedings of the 7th IAPR TC3 Workshop, Ulm, Germany, 28–30 September 2016, Springer.
3. Neto, G.M.R., Junior, G.B., de Almeida, J.D.S., and de Paiva, A.C. (2018, January 27–29). Sign language recognition based on 3d convolutional neural networks. Proceedings of the 15th International Conference Image Analysis and Recognition (ICIAR 2018), Póvoa de Varzim, Portugal.
4. Hand gesture recognition based on convolution neural network;Li;Clust. Comput.,2019
5. American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion;Tao;Eng. Appl. Artif. Intell.,2018
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献