Unsupervised Learning of Deep Feature Representation for Clustering Egocentric Actions

Author:

Bhatnagar Bharat Lal1,Singh Suriya1,Arora Chetan2,Jawahar C.V.1

Affiliation:

1. CVIT, KCIS, International Institute of Information Technology, Hyderabad

2. Indraprastha Institute of Information Technology, Delhi

Abstract

Popularity of wearable cameras in life logging, law enforcement, assistive vision and other similar applications is leading to explosion in generation of egocentric video content. First person action recognition is an important aspect of automatic analysis of such videos. Annotating such videos is hard, not only because of obvious scalability constraints, but also because of privacy issues often associated with egocentric videos. This motivates the use of unsupervised methods for egocentric video analysis. In this work, we propose a robust and generic unsupervised approach for first person action clustering. Unlike the contemporary approaches, our technique is neither limited to any particular class of actions nor requires priors such as pre-training, fine-tuning, etc. We learn time sequenced visual and flow features from an array of weak feature extractors based on convolutional and LSTM autoencoder networks. We demonstrate that clustering of such features leads to the discovery of semantically meaningful actions present in the video. We validate our approach on four disparate public egocentric actions datasets amounting to approximately 50 hours of videos. We show that our approach surpasses the supervised state of the art accuracies without using the action labels.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 16 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Interaction Replica: Tracking Human–Object Interaction and Scene Changes From Human Motion;2024 International Conference on 3D Vision (3DV);2024-03-18

2. SEMA: Semantic Attention for Capturing Long-Range Dependencies in Egocentric Lifelogs;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03

3. Badminton Shot Recognition with LSTM Network;Lecture Notes in Networks and Systems;2024

4. Unsupervised Deep Learning for IoT Time Series;IEEE Internet of Things Journal;2023-08-15

5. Towards Automated Ethogramming: Cognitively-Inspired Event Segmentation for Streaming Wildlife Video Monitoring;International Journal of Computer Vision;2023-04-28

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3