Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach
Author:
Wang Wenhao1ORCID, Liu Zhenbing1, Lu Haoxiang1, Lan Rushi1, Huang Yingxin1
Affiliation:
1. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China
Abstract
The quality of videos varies due to the different capabilities of sensors. Video super-resolution (VSR) is a technology that improves the quality of captured video. However, the development of a VSR model is very costly. In this paper, we present a novel approach for adapting single-image super-resolution (SISR) models to the VSR task. To achieve this, we first summarize a common architecture of SISR models and perform a formal analysis of adaptation. Then, we propose an adaptation method that incorporates a plug-and-play temporal feature extraction module into existing SISR models. The proposed temporal feature extraction module consists of three submodules: offset estimation, spatial aggregation, and temporal aggregation. In the spatial aggregation submodule, the features obtained from the SISR model are aligned to the center frame based on the offset estimation results. The aligned features are fused in the temporal aggregation submodule. Finally, the fused temporal feature is fed to the SISR model for reconstruction. To evaluate the effectiveness of our method, we adapt five representative SISR models and evaluate these models on two popular benchmarks. The experiment results show the proposed method is effective on different SISR models. In particular, on the Vid4 benchmark, the VSR-adapted models achieve at least 1.26 dB and 0.067 improvement over the original SISR models in terms of PSNR and SSIM metrics, respectively. Additionally, these VSR-adapted models achieve better performance than the state-of-the-art VSR models.
Funder
National Natural Science Foundation of China Guangxi Science Fund for Distinguished Young Scholars Guangxi Key Research and Development Program Innovation Project of Guangxi Graduate Education
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference51 articles.
1. Yang, C., Huang, Z., and Wang, N. (2022, January 18–24). QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA. 2. Shermeyer, J., and Etten, A.V. (2019, January 16–20). The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2019, Computer Vision Foundation/IEEE, Long Beach, CA, USA. 3. Dong, H., Xie, K., Xie, A., Wen, C., He, J., Zhang, W., Yi, D., and Yang, S. (2023). Detection of Occluded Small Commodities Based on Feature Enhancement under Super-Resolution. Sensors, 23. 4. Yuan, X., Fu, D., and Han, S. (2023). LRF-SRNet: Large-Scale Super-Resolution Network for Estimating Aircraft Pose on the Airport Surface. Sensors, 23. 5. Highly accurate protein structure prediction with AlphaFold;Jumper;Nature,2021
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|