Automatic Story Segmentation for TV News Video Using Multiple Modalities-Reference-Cited by-同舟云学术

Automatic Story Segmentation for TV News Video Using Multiple Modalities

Published:2012 Issue: Volume:2012 Page:1-11
ISSN:1687-7578
Container-title:International Journal of Digital Multimedia Broadcasting
language:en
Short-container-title:International Journal of Digital Multimedia Broadcasting

Author:

Dumont Émilie¹,Quénot Georges¹

Affiliation:

1. UJF-Grenoble 1/UPMF-Grenoble 2/Grenoble INP, CNRS, LIG UMR 5217, 38041 Grenoble, France

Abstract

While video content is often stored in rather large files or broadcasted in continuous streams, users are often interested in retrieving only a particular passage on a topic of interest to them. It is, therefore, necessary to split video documents or streams into shorter segments corresponding to appropriate retrieval units. We propose here a method for the automatic segmentation of TV news videos into stories. A-multiple-descriptor based segmentation approach is proposed. The selected multimodal features are complementary and give good insights about story boundaries. Once extracted, these features are expanded with a local temporal context and combined by an early fusion process. The story boundaries are then predicted using machine learning techniques. We investigate the system by experiments conducted using TRECVID 2003 data and protocol of the story boundary detection task, and we show that the proposed approach outperforms the state-of-the-art methods while requiring a very small amount of manual annotation.

Funder

OSEO, French state agency for innovation

Publisher

Hindawi Limited

Subject

Electrical and Electronic Engineering,Media Technology,Communication

Link

http://downloads.hindawi.com/journals/ijdmb/2012/732514.pdf

Reference7 articles.

1. The ARGOS campaign: Evaluation of video analysis and indexing tools

2. Real time video scene detection and classification

3. Neural network-based face detection

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparing neural sentence encoders for topic segmentation across domains: not your typical text similarity task;PeerJ Computer Science;2023-11-03

2. Exploring Pre-Trained Neural Audio Representations for Audio Topic Segmentation;2023 IEEE International Conference on Multimedia and Expo (ICME);2023-07

3. A Systematic Literature Review on Multimodal Machine Learning: Applications, Challenges, Gaps and Future Directions;IEEE Access;2023

4. Television Programs Classification via Deep Learning Approach Using SSMI-CNN;Applied Intelligence and Informatics;2022

5. Unsupervised story segmentation and indexing of broadcast news video;Multimedia Tools and Applications;2021-09-16