Explore human parsing modality for action recognition

Author:

Liu Jinfu1ORCID,Ding Runwei23,Wen Yuhang1ORCID,Dai Nan4,Meng Fanyang2,Zhang Fang‐Lue5,Zhao Shen1,Liu Mengyuan3

Affiliation:

1. School of Intelligent Systems Engineering Sun Yat‐sen University Shenzhen China

2. Peng Cheng Laboratory Shenzhen China

3. State Key Laboratory of General Artificial Intelligence Peking University Shenzhen Graduate School Shenzhen China

4. Changchun University of Science and Technology Changchun China

5. Victoria University of Wellington Wellington New Zealand

Abstract

AbstractMultimodal‐based action recognition methods have achieved high success using pose and RGB modality. However, skeletons sequences lack appearance depiction and RGB images suffer irrelevant noise due to modality limitations. To address this, the authors introduce human parsing feature map as a novel modality, since it can selectively retain effective semantic features of the body parts while filtering out most irrelevant noise. The authors propose a new dual‐branch framework called ensemble human parsing and pose network (EPP‐Net), which is the first to leverage both skeletons and human parsing modalities for action recognition. The first human pose branch feeds robust skeletons in the graph convolutional network to model pose features, while the second human parsing branch also leverages depictive parsing feature maps to model parsing features via convolutional backbones. The two high‐level features will be effectively combined through a late fusion strategy for better action recognition. Extensive experiments on NTU RGB + D and NTU RGB + D 120 benchmarks consistently verify the effectiveness of our proposed EPP‐Net, which outperforms the existing action recognition methods. Our code is available at https://github.com/liujf69/EPP‐Net‐Action.

Publisher

Institution of Engineering and Technology (IET)

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. HDBN: A Novel Hybrid Dual-Branch Network for Robust Skeleton-Based Action Recognition;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15

2. SEMIPL: A Semi-Supervised Method for Event Sound Source Localization;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3