Towards efficient video-based action recognition: context-aware memory attention network-Reference-Cited by-同舟云学术

Towards efficient video-based action recognition: context-aware memory attention network

Published:2023-11-13 Issue:12 Volume:5 Page:
ISSN:2523-3963
Container-title:SN Applied Sciences
language:en
Short-container-title:SN Appl. Sci.

Author:

Koh Thean Chun,Yeo Chai Kiat,Jing Xuan,Sivadas Sunil

Abstract

Abstract Given the prevalence of surveillance cameras in our daily lives, human action recognition from videos holds significant practical applications. A persistent challenge in this field is to develop more efficient models capable of real-time recognition with high accuracy for widespread implementation. In this research paper, we introduce a novel human action recognition model named Context-Aware Memory Attention Network (CAMA-Net), which eliminates the need for optical flow extraction and 3D convolution which are computationally intensive. By removing these components, CAMA-Net achieves superior efficiency compared to many existing approaches in terms of computation efficiency. A pivotal component of CAMA-Net is the Context-Aware Memory Attention Module, an attention module that computes the relevance score between key-value pairs obtained from the 2D ResNet backbone. This process establishes correspondences between video frames. To validate our method, we conduct experiments on four well-known action recognition datasets: ActivityNet, Diving48, HMDB51 and UCF101. The experimental results convincingly demonstrate the effectiveness of our proposed model, surpassing the performance of existing 2D-CNN based baseline models. Article Highlights

Recent human action recognition models are not yet ready for practical applications due to high computation needs.

We propose a 2D CNN-based human action recognition method to reduce the computation load.

The proposed method achieves competitive performance compared to most SOTA 2D CNN-based methods on public datasets.

Funder

RIE2020 Industry Alignment Fund – Industry Collaboration Projects

Publisher

Springer Science and Business Media LLC

Subject

General Earth and Planetary Sciences,General Physics and Astronomy,General Engineering,General Environmental Science,General Materials Science,General Chemical Engineering

Link

https://link.springer.com/content/pdf/10.1007/s42452-023-05568-5.pdf

Reference74 articles.

1. Ziaeefard M, Bergevin R (2015) Semantic human activity recognition: A literature review. Pattern Recognition 48(8):2329–2345

2. Aggarwal JK, Ryoo MS (2011) Human activity analysis: A review. Acm Computing Surveys (Csur) 43(3):1–43

3. Papadopoulos GT, Axenopoulos A, Daras P (2014) Real-time skeleton-tracking-based human action recognition using kinect data. In: International Conference on Multimedia Modeling, pp. 473–483. Springer

4. Kong Y, Fu Y (2022) Human action recognition and prediction: A survey. International Journal of Computer Vision 130(5):1366–1401

5. Zhang S, Wei Z, Nie J, Huang L, Wang S, Li Z (2017) A review on human activity recognition using vision-based method. Journal of healthcare engineering 2017

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FineTea: A Novel Fine-Grained Action Recognition Video Dataset for Tea Ceremony Actions;Journal of Imaging;2024-08-31

2. Application of Event Cameras and Neuromorphic Computing to VSLAM: A Survey;Biomimetics;2024-07-20

3. Implementing ViT Models for Traffic Sign Detection in Autonomous Driving Systems;2024 5th International Conference on Recent Trends in Computer Science and Technology (ICRTCST);2024-04-09