A Multi-Scale Deep Back-Projection Backbone for Face Super-Resolution with Diffusion Models
-
Published:2023-07-12
Issue:14
Volume:13
Page:8110
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Gao Juhao1, Tang Ni1ORCID, Zhang Dongxiao1ORCID
Affiliation:
1. School of Science, Jimei University, Xiamen 361021, China
Abstract
Face verification and recognition are important tasks that have made great progress in recent years. However, recognizing low-resolution faces from small images is still a difficult problem. In this paper, we advocate using diffusion models (DMs) to enhance face resolution and improve their quality for various downstream applications. Most existing DMs for super-resolution use U-Net as their backbone network, which only exploits multi-scale features along the spatial dimension. These approaches result in a slow convergence of corresponding DMs and the inability to capture complex details and fine textures. To address this issue, we propose a novel conditional generative model based on DMs called BPSR3, which replaces the U-Net in super-resolution via repeated refinement (SR3) with a multi-scale deep back-projection network structure. BPSR3 can extract richer features not only in depth but also in breadth. This helps to effectively refine the image quality at different scales. The experimental results on facial datasets show that BPSR3 significantly improved both convergence speed and reconstruction performance. BPSR3 has about 1/4 of the parameters of SR3 but achieves a 50.1% improvement in PSNR, a 19.8% improvement in SSIM, and a 15.4% reduction in FID. Our contribution lies in achieving less time and space consumption and better reconstruction results. In addition, we propose an idea of enhancing the performance of DMs by replacing the U-Net with a better network.
Funder
the National Fund Cultivation Program of Jimei University the National Natural Science Foundation of China the Natural Science Foundation of Fujian Province
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference43 articles.
1. Shalev-Shwartz, S., Shamir, O., and Shammah, S. (2017, January 6–11). Failures of Gradient-Based Deep Learning. Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia. 2. Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (2014). Proceedings of the Computer Vision—ECCV 2014, Springer International Publishing. 3. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015). Going Deeper with Convolutions. arXiv. 4. Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (2014). Proceedings of the Computer Vision—ECCV 2014, Springer International Publishing. 5. Kim, J., Lee, J.K., and Lee, K.M. (2016). Accurate Image Super-Resolution Using Very Deep Convolutional Networks. arXiv.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|