Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization-Reference-Cited by-同舟云学术

Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization

Published:2019-07-17 Issue: Volume:33 Page:9143-9150
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Yuan Li,Tay Francis EH,Li Ping,Zhou Li,Feng Jiashi

Abstract

In this paper, we present a novel unsupervised video summarization model that requires no manual annotation. The proposed model termed Cycle-SUM adopts a new cycleconsistent adversarial LSTM architecture that can effectively maximize the information preserving and compactness of the summary video. It consists of a frame selector and a cycle-consistent learning based evaluator. The selector is a bi-direction LSTM network that learns video representations that embed the long-range relationships among video frames. The evaluator defines a learnable information preserving metric between original video and summary video and “supervises” the selector to identify the most informative frames to form the summary video. In particular, the evaluator is composed of two generative adversarial networks (GANs), in which the forward GAN is learned to reconstruct original video from summary video while the backward GAN learns to invert the processing. The consistency between the output of such cycle learning is adopted as the information preserving metric for video summarization. We demonstrate the close relation between mutual information maximization and such cycle learning procedure. Experiments on two video summarization benchmark datasets validate the state-of-theart performance and superiority of the Cycle-SUM model over previous baselines.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 58 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Reference Evaluation of Dynamic Video Summaries Using Granule-Aware F-Measure;IEEE Transactions on Emerging Topics in Computational Intelligence;2024-08

2. Simulating urban expansion with interpretable cycle recurrent neural networks;GIScience & Remote Sensing;2024-06-03

3. Unsupervised video summarization with adversarial graph-based attention network;Journal of Visual Communication and Image Representation;2024-06

4. Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning;IEEE Transactions on Image Processing;2024

5. Efficient Video Summarization with Hydra Attentive Vision Transformer;2023 International Conference on Frontiers of Information Technology (FIT);2023-12-11