Video Summarization via Semantic Attended Networks-Reference-Cited by-同舟云学术

Video Summarization via Semantic Attended Networks

Published:2018-04-25 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Wei Huawei,Ni Bingbing,Yan Yichao,Yu Huanyu,Yang Xiaokang,Yao Chen

Abstract

The goal of video summarization is to distill a raw video into a more compact form without losing much semantic information. However, previous methods mainly consider the diversity and representation interestingness of the obtained summary, and they seldom pay sufficient attention to semantic information of resulting frame set, especially the long temporal range semantics. To explicitly address this issue, we propose a novel technique which is able to extract the most semantically relevant video segments (i.e., valid for a long term temporal duration) and assemble them into an informative summary. To this end, we develop a semantic attended video summarization network (SASUM) which consists of a frame selector and video descriptor to select an appropriate number of video shots by minimizing the distance between the generated description sentence of the summarized video and the human annotated text of the original video. Extensive experiments show that our method achieves a superior performance gain over previous methods on two benchmark datasets.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 36 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Attention-guided multi-granularity fusion model for video summarization;Expert Systems with Applications;2024-09

2. Query-attentive video summarization: a comprehensive review;Multimedia Tools and Applications;2024-08-06

3. GAT-Based Bi-CARU with Adaptive Feature-Based Transformation for Video Summarisation;Technologies;2024-08-05

4. Advancing Video Summarization Using Language-Based Attention Transformer;2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT);2024-07-04

5. Video summarization via knowledge-aware multimodal deep networks;Knowledge-Based Systems;2024-06