Part Aware Contrastive Learning for Self-Supervised Action Recognition

Author:

Hua Yilei1,Wu Wenhan2,Zheng Ce3,Lu Aidong2,Liu Mengyuan4,Chen Chen3,Wu Shiqian1

Affiliation:

1. School of Information Science and Engineering, Wuhan University of Science and Technology

2. University of North Carolina at Charlotte

3. Center for Research in Computer Vision,University of Central Florida

4. Peking University, Shenzhen Graduate School

Abstract

In recent years, remarkable results have been achieved in self-supervised action recognition using skeleton sequences with contrastive learning. It has been observed that the semantic distinction of human action features is often represented by local body parts, such as legs or hands, which are advantageous for skeleton-based action recognition. This paper proposes an attention-based contrastive learning framework for skeleton representation learning, called SkeAttnCLR, which integrates local similarity and global features for skeleton-based action representations. To achieve this, a multi-head attention mask module is employed to learn the soft attention mask features from the skeletons, suppressing non-salient local features while accentuating local salient features, thereby bringing similar local features closer in the feature space. Additionally, ample contrastive pairs are generated by expanding contrastive pairs based on salient and non-salient features with global features, which guide the network to learn the semantic representations of the entire skeleton. Therefore, with the attention mask mechanism, SkeAttnCLR learns local features under different data augmentation views. The experiment results demonstrate that the inclusion of local feature similarity significantly enhances skeleton-based action representation. Our proposed SkeAttnCLR outperforms state-of-the-art methods on NTURGB+D, NTU120-RGB+D, and PKU-MMD datasets. The code and settings are available at this repository: https://github.com/GitHubOfHyl97/SkeAttnCLR.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A lightweight attention-driven distillation model for human pose estimation;Pattern Recognition Letters;2024-09

2. Intelligent Surveillance of Airport Apron: Detection and Location of Abnormal Behavior in Typical Non-Cooperative Human Objects;Applied Sciences;2024-07-16

3. Edge-Joint Assisted and Salient Enhanced Self-Supervised Action Recognition;2024 IEEE 14th International Conference on Electronics Information and Emergency Communication (ICEIEC);2024-05-24

4. Self-supervised action representation learning from partial consistency skeleton sequences;Neural Computing and Applications;2024-04-21

5. LAMP: Leveraging Language Prompts for Multi-Person Pose Estimation;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3