Author:
Patel Pradip,Narendra Patel
Abstract
Human Centered Computing is an emerging research field that aims to understand human behavior. Dynamic hand gesture recognition is one of the most recent, challenging and appealing application in this field. We have proposed one vision based system to recognize dynamic hand gestures for Indian Sign Language (ISL) in this paper. The system is built by using a unified architecture formed by combining Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM). In order to hit the shortage of a huge labeled hand gesture dataset, we have created two different CNN by retraining a well known image classification networks GoogLeNet and VGG16 using transfer learning. Frames of gesture videos are transformed into features vectors using these CNNs. As these videos are prearranged series of image frames, LSTM model have been used to join with the fully-connected layer of CNN. We have evaluated the system on three different datasets consisting of color videos with 11, 64 and 8 classes. During experiments it is found that the proposed CNN-LSTM architecture using GoogLeNet is fast and efficient having capability to achieve very high recognition rates of 93.18%, 97.50%, and 96.65% on the three datasets respectively.
Publisher
Perpetual Innovation Media Pvt. Ltd.
Reference32 articles.
1. Adithya, V. and Rajesh, R. 2020. Hand gestures for emergency situations: A video dataset based on words from indian sign language. Data in Brief Vol.31.
2. Chen, G. and Ge, K. 2020. A fusion recognition method based on multifeature hidden markov model for dynamic hand gesture. Computational Intelligence and Neuroscience No.8871605.
3. Dadashzadeh, A., Targhi, A., Tahmasbi, M., and Mirmehdi, M. 2019. Hgr-net: a fusion network for hand gesture segmentation and recognition. IET Computer Vision Vol.13, No.8.
4. Gangrade, J. and Bharti, J. 2020. Vision-based hand gesture recognition for indian sign language using convolution neural network. IETE Journal of Research Vol.31, pp.1–10.
5. Gers, F. A., Schmidhuber, J., and Cummins, F. 2000. Learning to forget: Continual pre- diction with lstm. Neural Computation Vol.12, pp.2451–2471.