A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency-Reference-Cited by-同舟云学术

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Published:2022-10-10 Issue:19 Volume:22 Page:7689
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wang Xu^ORCID,Li Yujie,Wang Haoyu,Huang Longzhao,Ding Shuxue

Abstract

Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function (R(S)) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model.

Funder

National Natural Science Foundation of China

Guangxi Natural Science Foundation

Guangxi Science and Technology Major Project

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/19/7689/pdf

Reference61 articles.

1. See all by looking at a few: Sparse modeling for finding representative objects;Elhamifar;Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition,2012

2. DC programming for solving a sparse modeling problem of video key frame extraction

3. Memorable and rich video summarization

4. Cost-Effective Video Summarization Using Deep CNN With Hierarchical Weighted Fusion for IoT Surveillance Networks

5. Efficient CNN based summarization of surveillance videos for resource-constrained devices

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Image Caption Generation using Deep Learning For Video Summarization Applications;International Journal of Advanced Computer Science and Applications;2024

2. A review for video summarization based on trajectories features;AIP Conference Proceedings;2024

3. Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation;Sensors;2023-03-23

4. Automatic video summarization and classification by CNN model: Deep learning;2023 International Conference on Computer Communication and Informatics (ICCCI);2023-01-23