Affiliation:
1. Department of Multimedia and Information-Communication Technologies, University of Zilina, 010 26 Zilina, Slovakia
Abstract
Currently, three-dimensional convolutional neural networks (3DCNNs) are a popular approach in the field of human activity recognition. However, due to the variety of methods used for human activity recognition, we propose a new deep-learning model in this paper. The main objective of our work is to optimize the traditional 3DCNN and propose a new model that combines 3DCNN with Convolutional Long Short-Term Memory (ConvLSTM) layers. Our experimental results, which were obtained using the LoDVP Abnormal Activities dataset, UCF50 dataset, and MOD20 dataset, demonstrate the superiority of the 3DCNN + ConvLSTM combination for recognizing human activities. Furthermore, our proposed model is well-suited for real-time human activity recognition applications and can be further enhanced by incorporating additional sensor data. To provide a comprehensive comparison of our proposed 3DCNN + ConvLSTM architecture, we compared our experimental results on these datasets. We achieved a precision of 89.12% when using the LoDVP Abnormal Activities dataset. Meanwhile, the precision we obtained using the modified UCF50 dataset (UCF50mini) and MOD20 dataset was 83.89% and 87.76%, respectively. Overall, our work demonstrates that the combination of 3DCNN and ConvLSTM layers can improve the accuracy of human activity recognition tasks, and our proposed model shows promise for real-time applications.
Funder
Slovak Research and Development Agency
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference22 articles.
1. An enhanced 3DCNN-ConvLSTM for spatiotemporal multimedia data analysis;Wang;Concurr. Comput. Pract. Exp.,2021
2. 3DCNN Backed Conv-LSTM Auto Encoder for Micro Facial Expression Video Recognition;Islam;Mach. Learn. Intell. Commun.,2021
3. Continuous Gesture Segmentation and Recognition using 3DCNN and Convolutional LSTM;Zhu;IEEE Trans. Multimed.,2019
4. GssMILP for anomaly classification in surveillance videos;Krishna;IEEE Expert Syst. Appl.,2022
5. Pediaditis, M., Farmaki, C., Schiza, S., Tzanakis, N., Galanakis, E., and Sakkalis, V. (2022, January 21–23). Contactless respiratory rate estimation from video in a real-life clinical environment using eulerian magnification and 3D CNNs. Proceedings of the IEEE International Conference on Imaging Systems and Techniques, Kaohsiung, Taiwan.
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献