Egocentric Video Summarization Based on People Interaction Using Deep Learning

Author:

Ghafoor Humaira A.1,Javed Ali1ORCID,Irtaza Aun2,Dawood Hassan1,Dawood Hussain3,Banjar Ameen3

Affiliation:

1. Department of Software Engineering, University of Engineering and Technology, Taxila 47050, Pakistan

2. Department of Computer Science, University of Engineering and Technology, Taxila 47050, Pakistan

3. Faculty of Computing and Information Technology, University of Jeddah, Saudi Arabia

Abstract

The availability of wearable cameras in the consumer market has motivated the users to record their daily life activities and post them on the social media. This exponential growth of egocentric videos demand to develop automated techniques to effectively summarizes the first-person video data. Egocentric videos are commonly used to record lifelogs these days due to the availability of low cost wearable cameras. However, egocentric videos are challenging to process due to the fact that placement of camera results in a video which presents great deal of variation in object appearance, illumination conditions, and movement. This paper presents an egocentric video summarization framework based on detecting important people in the video. The proposed method generates a compact summary of egocentric videos that contains information of the people whom the camera wearer interacts with. Our proposed approach focuses on identifying the interaction of camera wearer with important people. We have used AlexNet convolutional neural network to filter the key-frames (frames where camera wearer interacts closely with the people). We used five convolutional layers and two completely connected hidden layers and an output layer. Dropout regularization method is used to reduce the overfitting problem in completely connected layers. Performance of the proposed method is evaluated onUT Egostandard dataset. Experimental results signify the effectiveness of the proposed method in terms of summarizing the egocentric videos.

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Cited by 11 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Dynamic video summarisation using stacked encoder-decoder architecture with residual learning network;International Journal of Intelligent Engineering Informatics;2024

2. Towards machine vision-based video analysis in smart cities: a survey, framework, applications and open issues;Multimedia Tools and Applications;2023-08-09

3. CoSumNet: A video summarization-based framework for COVID-19 monitoring in crowded scenes;Artificial Intelligence in Medicine;2023-05

4. Data-driven enabled approaches for criteria-based video summarization: a comprehensive survey, taxonomy, and future directions;Multimedia Tools and Applications;2023-03-02

5. Making a long video short: A Systematic Review;Proceedings of the 4th International Conference on Information Management & Machine Intelligence;2022-12-23

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3