Prior Knowledge-guided Hierarchical Action Quality Assessment with 3D Convolution and Attention Mechanism-Reference-Cited by-同舟云学术

Prior Knowledge-guided Hierarchical Action Quality Assessment with 3D Convolution and Attention Mechanism

Published:2023-11-01 Issue:1 Volume:2632 Page:012027
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Zhou Haoyang,Hou Teng,Li Jitao

Abstract

Abstract Recently, there has been a growing interest in the field of computer vision and deep learning regarding a newly emerging problem known as action quality assessment (AQA). However, most researchers still rely on the traditional approach of using models from the video action recognition field. Unfortunately, this approach overlooks crucial features in AQA, such as movement fluency and degree of completion. Alternatively, some researchers have employed the transformer paradigm to capture action details and overall action integrity, but the high computational cost associated with transformers makes them impractical for real-time tasks. Due to the diversity of action types, it is challenging to rely solely on a shared model for quality assessment of various types of actions. To address these issues, we propose a novel network structure for AQA, which is the first to integrate multi-model capabilities through a classification model. Specifically, we utilize a pre-trained I3D model equipped with a self-attention block for classification. This allows us to evaluate various categories of actions using just one model. Furthermore, we introduce self-attention mechanisms and multi-head attention into the traditional convolutional neural network. By systematically replacing the last few layers of the conventional convolutional network, our model gains a greater ability to sense the global coordination of different actions. We have verified the effectiveness of our approach on the AQA-7 dataset. In comparison to other popular models, our model achieves satisfactory performance while maintaining a low computational cost.

Publisher

IOP Publishing

Subject

Computer Science Applications,History,Education

Link

https://iopscience.iop.org/article/10.1088/1742-6596/2632/1/012027/pdf

Reference19 articles.

1. Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks

2. Temporal clustering of surgical activities in robot-assisted surgery;Zia;Int J Comput Assist Radiol Surg,2017

3. What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

4. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GYMetricPose: A light-weight angle-based graph adaptation for action quality assessment;2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS);2024-06-26