A stereoscopic video conversion scheme based on spatio-temporal analysis of MPEG videos-Reference-Cited by-同舟云学术

A stereoscopic video conversion scheme based on spatio-temporal analysis of MPEG videos

Published:2012-11-12 Issue:1 Volume:2012 Page:
ISSN:1687-6180
Container-title:EURASIP Journal on Advances in Signal Processing
language:en
Short-container-title:EURASIP J. Adv. Signal Process.

Author:

Lin Guo-Shiang,Huang Hsiang-Yun,Chen Wei-Chih,Yeh Cheng-Ying,Liu Kai-Che,Lie Wen-Nung

Abstract

Abstract In this article, an automatic stereoscopic video conversion scheme which accepts MPEG-encoded videos as input is proposed. Our scheme is depth-based, relying on spatio-temporal analysis of the decoded video data to yield depth perception cues, such as temporal motion and spatial contrast, which reflect the relative depths between the foreground and the background areas. Our scheme is shot-adaptive, demanding that shot change detection and shot classification be performed for tuning of algorithm or parameters that are used for depth cue combination. The above-mentioned depth estimation is initially block-based, followed by a locally adaptive joint trilateral upsampling algorithm to reduce the computing load significantly. A recursive temporal filter is used to reduce the possible depth fluctuations (and also artifacts in the synthesized images) resulting from wrong depth estimations. The traditional Depth-Image-Based-Rendering algorithm is used to synthesize the left- and right-view frames for 3D display. Subjective tests show that videos converted by our scheme provide comparable perceived depth and visual quality with those converted from the depth data calculated by stereo vision techniques. Also, our scheme is shown to outperform the well-known TriDef software in terms of human’s perceived 3D depth. Based on the implementation by using “OpenMP” parallel programming model, our scheme is capable of executing in real-time on a multi-core CPU platform.

Publisher

Springer Science and Business Media LLC

Link

http://link.springer.com/content/pdf/10.1186/1687-6180-2012-237.pdf

Reference35 articles.

1. Cheng CM, Lin SJ, Lai SH: Spatio-temporally consistent novel view synthesis algorithm from video-plus depth sequences for autostereoscopic displays. IEEE Trans. Broadcast. 2011, 57(2):523-532.

2. Quan HT, Barkowsky M, Callet PL: The importance of visual attention in improving the 3D-TV viewing experience: overview and new perspectives. IEEE Trans. Broadcast. 2011, 57(2):421-431.

3. Lin GS, Yeh CY, Chen WC, Lie WN: A 2D to 3D conversion scheme based on depth cues analysis for MPEG videos. In Proceedings of the IEEE International Conference on Multimedia and Expo. Singapore; 2010:1141-1145.

4. Zhang L, Vazquez C, Knorr A: 3D-TV content creation: automatic 2D-to-3D video conversion. IEEE Trans. Broadcast. 2011, 57(2):372-383.

5. Wang HM, Chen YH, Yang JF: A novel matching frame selection method for stereoscopic video generation. In Proceedings of the IEEE Int'l Conf. on Multimedia and Expo. New York, USA; 2009:1174-1177.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semi-automatic 2D-to-3D video conversion based on background sprite generation;Journal of Visual Communication and Image Representation;2020-07

2. Key-Frame-Based Background Sprite Generation for Hole Filling in Depth Image-Based Rendering;IEEE Transactions on Multimedia;2018-05

3. Key-frame-based depth propagation for semi-automatic stereoscopic video conversion;Journal of Visual Communication and Image Representation;2017-02

4. Monocular vision-based depth map extraction method for 2D to 3D video conversion;EURASIP Journal on Image and Video Processing;2016-06-03

5. Video quality enhancement based on visual attention model and multi-level exposure correction;Multimedia Tools and Applications;2015-08-08