FEASE: Feature Selection and Enhancement Networks for Action Recognition-Reference-Cited by-同舟云学术

FEASE: Feature Selection and Enhancement Networks for Action Recognition

Published:2024-03-06 Issue:2 Volume:56 Page:
ISSN:1573-773X
Container-title:Neural Processing Letters
language:en
Short-container-title:Neural Process Lett

Author:

Zhou Lu,Lu Yuanyao,Jiang Haiyang

Abstract

AbstractReinforcement of motor features is necessary in action recognition tasks. In this work, we propose an efficient feature reinforcement model, termed as Feature Selection and Enhancement Networks (FEASE-Net). The core of our FEASE-Net is the use of the FEASE module to adaptively capture input features at multi-scales and reinforce them globally. FEASE module is composed of two sub-module, Feature Selection (FS) and Feature Enhancement (FE). The FS focuses on adaptive attention and selection of input features through a multi-scale structure with an attention mechanism, and FE employs channel attention to enhance the global useful feature information. To assess the effectiveness of FEASE-Net, we undertake a series of extensive experiments on two benchmark datasets, namely Kinetics 400 and Something-Something V2. Our proposed FEASE-Net can achieve a competitive performance compared with previous state-of-the-art methods that use similar backbones.

Funder

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11063-024-11547-7.pdf

Reference58 articles.

1. Feichtenhofer C, Fan H, Malik J, He K (2018) Slowfast networks for video recognition. IEEE/CVF Int Conf Comput Vision (ICCV) 2019:6201–6210

2. Koohzadi M, Charkari NM (2020) A context based deep temporal embedding network in action recognition. Neural Process Lett 52:187–220

3. Li B, Pan Y-T, Liu R, Zhu Y (2023) Separately guided context-aware network for weakly supervised temporal action detection. Neural Process Lett. https://doi.org/10.1007/s11063-022-11138-4

4. Karpathy A, Toderici G, Shetty S, Leung T, Sukthankar R, Fei-Fei L (2014) Large-scale video classification with convolutional neural networks. IEEE Conf Comput Visi Patt Recognit 2014:1725–1732

5. Feichtenhofer C, Pinz A, Wildes RP (2017) Spatiotemporal multiplier networks for video action recognition. IEEE Conf Comput Vision Patt Recognit (CVPR) 2017:7445–7454

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient spatio-temporal network for action recognition;Journal of Real-Time Image Processing;2024-08-23