Arbitrary Timestep Video Frame Interpolation with Time-Dependent Decoding-Reference-Cited by-同舟云学术

Arbitrary Timestep Video Frame Interpolation with Time-Dependent Decoding

Published:2024-01-17 Issue:2 Volume:12 Page:303
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zhang Haokai¹,Ren Dongwei¹^ORCID,Yan Zifei¹,Zuo Wangmeng¹

Affiliation:

1. Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China

Abstract

Given an observed low frame rate video, video frame interpolation (VFI) aims to generate a high frame rate video, which has smooth video frames with higher frames per second (FPS). Most existing VFI methods often focus on generating one frame at a specific timestep, e.g., 0.5, between every two frames, thus lacking the flexibility to increase the video’s FPS by an arbitrary scale, e.g., 3. To better address this issue, in this paper, we propose an arbitrary timestep video frame interpolation (ATVFI) network with time-dependent decoding. Generally, the proposed ATVFI is an encoder–decoder architecture, where the interpolation timestep is an extra input added to the decoder network; this enables ATVFI to interpolate frames at arbitrary timesteps between input frames and to increase the video’s FPS at any given scale. Moreover, we propose a data augmentation method, i.e., multi-width window sampling, where video frames can be split into training samples with multiple window widths, to better leverage training frames for arbitrary timestep interpolation. Extensive experiments were conducted to demonstrate the superiority of our model over existing baseline models on several testing datasets. Specifically, our model trained on the GoPro training set achieved 32.50 on the PSNR metric on the commonly used Vimeo90k testing set.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/2/303/pdf

Reference46 articles.

1. Niklaus, S., Mai, L., and Liu, F. (2017, January 22–29). Video Frame Interpolation via Adaptive Separable Convolution. Proceedings of the IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy.

2. Niklaus, S., and Liu, F. (2018, January 18–22). Context-Aware Synthesis for Video Frame Interpolation. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA.

3. Gui, S., Wang, C., Chen, Q., and Tao, D. (2020, January 13–19). FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA.

4. Reda, F.A., Kontkanen, J., Tabellion, E., Sun, D., Pantofaru, C., and Curless, B. (2022). FILM: Frame Interpolation for Large Motion. arXiv.

5. Peleg, T., Szekely, P., Sabo, D., and Sendik, O. (2019, January 16–20). IM-Net for High Resolution Video Frame Interpolation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA.