1. ViViT: A Video Vision Transformer
2. Depth-Aware Video Frame Interpolation
3. Wenbo Bao , Wei-Sheng Lai , Xiaoyun Zhang , Zhiyong Gao , and Ming-Hsuan Yang . 2019 b. Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement . IEEE transactions on pattern analysis and machine intelligence, Vol. 43 , 3 (2019), 933--948. Wenbo Bao, Wei-Sheng Lai, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang. 2019b. Memc-net: Motion estimation and motion compensation driven neural network for video interpolation and enhancement. IEEE transactions on pattern analysis and machine intelligence, Vol. 43, 3 (2019), 933--948.
4. Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly etal 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020). Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly et al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
5. Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation