Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward-Reference-Cited by-同舟云学术

Deep Reinforcement Learning for Unsupervised Video Summarization With Diversity-Representativeness Reward

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zhou Kaiyang,Qiao Yu,Xiang Tao

Abstract

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decision-making process and develop a deep summarization network (DSN) to summarize videos. DSN predicts for each video frame a probability, which indicates how likely a frame is selected, and then takes actions based on the probability distributions to select frames, forming video summaries. To train our DSN, we propose an end-to-end, reinforcement learning-based framework, where we design a novel reward function that jointly accounts for diversity and representativeness of generated summaries and does not rely on labels or user interactions at all. During training, the reward function judges how diverse and representative the generated summaries are, while DSN strives for earning higher rewards by learning to produce more diverse and more representative summaries. Since labels are not required, our method can be fully unsupervised. Extensive experiments on two benchmark datasets show that our unsupervised method not only outperforms other state-of-the-art unsupervised methods, but also is comparable to or even superior than most of published supervised approaches.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 128 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A two-stream sign language recognition network based on keyframe extraction method;Expert Systems with Applications;2024-11

2. Attention-guided multi-granularity fusion model for video summarization;Expert Systems with Applications;2024-09

3. Query-attentive video summarization: a comprehensive review;Multimedia Tools and Applications;2024-08-06

4. GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation;Technologies;2024-08-05

5. Multi-Reference Evaluation of Dynamic Video Summaries Using Granule-Aware F-Measure;IEEE Transactions on Emerging Topics in Computational Intelligence;2024-08