Continuous Human Action Recognition for Human-machine Interaction: A Review-Reference-Cited by-同舟云学术

Continuous Human Action Recognition for Human-machine Interaction: A Review

Published:2023-07-13 Issue:13s Volume:55 Page:1-38
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Gammulle Harshala¹^ORCID,Ahmedt-Aristizabal David²^ORCID,Denman Simon¹^ORCID,Tychsen-Smith Lachlan²^ORCID,Petersson Lars²^ORCID,Fookes Clinton¹^ORCID

Affiliation:

1. Queensland University of Technology, Australia

2. CSIRO Data61, Australia

Abstract

With advances in data-driven machine learning research, a wide variety of prediction models have been proposed to capture spatio-temporal features for the analysis of video streams. Recognising actions and detecting action transitions within an input video are challenging but necessary tasks for applications that require real-time human-machine interaction. By reviewing a large body of recent related work in the literature, we thoroughly analyse, explain, and compare action segmentation methods and provide details on the feature extraction and learning strategies that are used on most state-of-the-art methods. We cover the impact of the performance of object detection and tracking techniques on human action segmentation methodologies. We investigate the application of such models to real-world scenarios and discuss several limitations and key research directions towards improving interpretability, generalisation, optimisation, and deployment.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3587931

Reference145 articles.

1. David Ahmedt-Aristizabal Mohammad Ali Armin Simon Denman Clinton Fookes and Lars Petersson. 2022. A survey on graph-based deep learning for computational histopathology. Computerized Medical Imaging and Graphics 95 (2022) 102027.

2. Refining Action Segmentation with Hierarchical Video Representations

3. AlexeyAB. 2021. Darknet: Open Source Neural Networks in C. Retrieved from https://github.com/AlexeyAB/darknet.

4. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling;Bai Shaojie;arXiv preprint arXiv:1803.01271,2018

5. Tracking Without Bells and Whistles

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Thermal infrared action recognition with two-stream shift Graph Convolutional Network;Machine Vision and Applications;2024-05-13

2. Deep learning approaches for seizure video analysis: A review;Epilepsy & Behavior;2024-05

3. Spatio-Temporal Correlation Learning for Multiple Object Tracking;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

4. Action recognition in compressed domains: A survey;Neurocomputing;2024-04

5. Enhancing early action prediction in videos through temporal composition of sub-actions;Multimedia Tools and Applications;2024-03-18