A deep neural network model for multi-view human activity recognition-Reference-Cited by-同舟云学术

A deep neural network model for multi-view human activity recognition

Published:2022-01-07 Issue:1 Volume:17 Page:e0262181
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Putra Prasetia Utama^ORCID,Shima Keisuke,Shimatani Koji

Abstract

Multiple cameras are used to resolve occlusion problem that often occur in single-view human activity recognition. Based on the success of learning representation with deep neural networks (DNNs), recent works have proposed DNNs models to estimate human activity from multi-view inputs. However, currently available datasets are inadequate in training DNNs model to obtain high accuracy rate. Against such an issue, this study presents a DNNs model, trained by employing transfer learning and shared-weight techniques, to classify human activity from multiple cameras. The model comprised pre-trained convolutional neural networks (CNNs), attention layers, long short-term memory networks with residual learning (LSTMRes), and Softmax layers. The experimental results suggested that the proposed model could achieve a promising performance on challenging MVHAR datasets: IXMAS (97.27%) and i3DPost (96.87%). A competitive recognition rate was also observed in online classification.

Funder

Japan Society for the Promotion of Science

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference69 articles.

1. Single/multi-view human action recognition via regularized multi-task learning;AA Liu;Neurocomputing,2015

2. A framework of human detection and action recognition based on uniform segmentation and combination of Euclidean distance and joint entropy-based features selection;M Sharif;EURASIP Journal on Image and Video Processing,2017

3. An implementation of optimized framework for action classification using multilayers neural network on selected fused features;MA Khan;Pattern Analysis and Applications,2019

4. Baltieri D, Vezzani R, Cucchiara R. 3dpes: 3d people dataset for surveillance and forensics. In: Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding; 2011. p. 59–64.

5. A multiview multimodal system for monitoring patient sleep;C Torres;IEEE Transactions on Multimedia,2018

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Insights on the Distribution of Nonverbal and Verbal Oral Presentation Skills in an Educational Institution;SN Computer Science;2024-04-25

2. Anomalous Human Action Recognition with Deep Learning Technique;2024 11th International Conference on Computing for Sustainable Global Development (INDIACom);2024-02-28

3. A Survey of Motion Data Processing and Classification Techniques Based on Wearable Sensors;IgMin Research;2023-12-04

4. A survey on intelligent human action recognition techniques;Multimedia Tools and Applications;2023-11-11

5. 3D reconstruction of human bodies from single-view and multi-view images: A systematic review;Computer Methods and Programs in Biomedicine;2023-09