Static Video Summarization Using Video Coding Features with Frame-Level Temporal Subsampling and Deep Learning-Reference-Cited by-同舟云学术

Static Video Summarization Using Video Coding Features with Frame-Level Temporal Subsampling and Deep Learning

Published:2023-05-15 Issue:10 Volume:13 Page:6065
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Issa Obada¹^ORCID,Shanableh Tamer¹^ORCID

Affiliation:

1. Department of Computer Science and Engineering, American University of Sharjah, Sharjah P.O. Box 26666, United Arab Emirates

Abstract

There is an abundance of digital video content due to the cloud’s phenomenal growth and security footage; it is therefore essential to summarize these videos in data centers. This paper offers innovative approaches to the problem of key frame extraction for the purpose of video summarization. Our approach includes the extraction of feature variables from the bit streams of coded videos, followed by optional stepwise regression for dimensionality reduction. Once the features are extracted and their dimensionality is reduced, we apply innovative frame-level temporal subsampling techniques, followed by training and testing using deep learning architectures. The frame-level temporal subsampling techniques are based on cosine similarity and the PCA projections of feature vectors. We create three different learning architectures by utilizing LSTM networks, 1D-CNN networks, and random forests. The four most popular video summarization datasets, namely, TVSum, SumMe, OVP, and VSUMM, are used to evaluate the accuracy of the proposed solutions. This includes the precision, recall, F-score measures, and computational time. It is shown that the proposed solutions, when trained and tested on all subjective user summaries, achieved F-scores of 0.79, 0.74, 0.88, and 0.81, respectively, for the aforementioned datasets, showing clear improvements over prior studies.

Funder

American University of Sharjah

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/10/6065/pdf

Reference52 articles.

1. Survey of Compressed Domain Video Summarization Techniques;Basavarajaiah;ACM Comput. Surv.,2020

2. Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V., and Patras, I. (2021). Video Summarization Using Deep Neural Networks: A Survey. arXiv.

3. Others Dimensionality reduction: A comparative study;Postma;J. Mach. Learn. Res.,2009

4. Random Forests;Breiman;Mach. Learn.,2001

5. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2014). Going Deeper with Convolutions. arXiv.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Static video summarization based on genetic algorithm and deep learning approach;Multimedia Tools and Applications;2024-06-03

2. A deep audio-visual model for efficient dynamic video summarization;Journal of Visual Communication and Image Representation;2024-04

3. Static video summarization with multi-objective constrained optimization;Journal of Ambient Intelligence and Humanized Computing;2024-04

4. Image Caption Generation using Deep Learning For Video Summarization Applications;International Journal of Advanced Computer Science and Applications;2024

5. Method of Coding Video Images Based on Meta-Determination of Segments;Lecture Notes in Electrical Engineering;2024