Near-lossless semantic video summarization and its applications to video analysis-Reference-Cited by-同舟云学术

Near-lossless semantic video summarization and its applications to video analysis

Published:2013-06 Issue:3 Volume:9 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Mei Tao¹,Tang Lin-Xie²,Tang Jinhui³,Hua Xian-Sheng⁴

Affiliation:

1. Microsoft Research Asia, China

2. University of Science and Technology of China, China

3. Nanjing University of Science and Technology, China

4. Microsoft, USA

Abstract

The ever increasing volume of video content on the Web has created profound challenges for developing efficient indexing and search techniques to manage video data. Conventional techniques such as video compression and summarization strive for the two commonly conflicting goals of low storage and high visual and semantic fidelity. With the goal of balancing both video compression and summarization, this article presents a novel approach, called Near-Lossless Semantic Summarization (NLSS), to summarize a video stream with the least high-level semantic information loss by using an extremely small piece of metadata. The summary consists of compressed image and audio streams, as well as the metadata for temporal structure and motion information. Although at a very low compression rate (around 1/40 of H.264 baseline, where traditional compression techniques can hardly preserve an acceptable visual fidelity), the proposed NLSS still can be applied to many video-oriented tasks, such as visualization, indexing and browsing, duplicate detection, concept detection, and so on. We evaluate the NLSS on TRECVID and other video collections, and demonstrate that it is a powerful tool for significantly reducing storage consumption, while keeping high-level semantic fidelity.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/2487268.2487269

Reference53 articles.

1. Content-driven adaptation of on-line video

2. Bing. 2013. http://www.bing.com/?scope=video/. Bing. 2013. http://www.bing.com/?scope=video/.

3. An interactive comic book presentation for exploring video

4. A unified approach to shot change detection and camera motion characterization

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic summarization of endoscopic skull base surgical videos through object detection and hidden Markov modeling;Computerized Medical Imaging and Graphics;2023-09

2. Towards machine vision-based video analysis in smart cities: a survey, framework, applications and open issues;Multimedia Tools and Applications;2023-08-09

3. HiEve: A Large-Scale Benchmark for Human-Centric Video Analysis in Complex Events;International Journal of Computer Vision;2023-07-10

4. Data-driven enabled approaches for criteria-based video summarization: a comprehensive survey, taxonomy, and future directions;Multimedia Tools and Applications;2023-03-02

5. Progressive Localization Networks for Language-Based Moment Localization;ACM Transactions on Multimedia Computing, Communications, and Applications;2023-02-06