Super-resolution reconstruction of single image for latent features-Reference-Cited by-同舟云学术

Super-resolution reconstruction of single image for latent features

Published:2024-05-24 Issue: Volume: Page:
ISSN:2096-0433
Container-title:Computational Visual Media
language:en
Short-container-title:Comp. Visual Media

Author:

Wang Xin,Yan Jing-Ke,Cai Jing-Ye,Deng Jian-Hua,Qin Qin,Cheng Yao

Abstract

AbstractSingle-image super-resolution (SISR) typically focuses on restoring various degraded low-resolution (LR) images to a single high-resolution (HR) image. However, during SISR tasks, it is often challenging for models to simultaneously maintain high quality and rapid sampling while preserving diversity in details and texture features. This challenge can lead to issues such as model collapse, lack of rich details and texture features in the reconstructed HR images, and excessive time consumption for model sampling. To address these problems, this paper proposes a Latent Feature-oriented Diffusion Probability Model (LDDPM). First, we designed a conditional encoder capable of effectively encoding LR images, reducing the solution space for model image reconstruction and thereby improving the quality of the reconstructed images. We then employed a normalized flow and multimodal adversarial training, learning from complex multimodal distributions, to model the denoising distribution. Doing so boosts the generative modeling capabilities within a minimal number of sampling steps. Experimental comparisons of our proposed model with existing SISR methods on mainstream datasets demonstrate that our model reconstructs more realistic HR images and achieves better performance on multiple evaluation metrics, providing a fresh perspective for tackling SISR tasks.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s41095-023-0387-8.pdf

Reference64 articles.

1. Cheng, L.; Fang, P.; Liang, Y.; Zhang, L.; Shen, C.; Wang, H. TSGB: Target-selective gradient backprop for probing CNN visual saliency. IEEE Transactions on Image Processing Vol. 31, 2529–2540, 2022.

2. Jiang, D.; Jin, Y.; Zhang, F. L.; Zhu, Z.; Zhang, Y.; Tong, R.; Tang, M. Sphere face model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training. Computational Visual Media Vol. 9, No. 2, 279–296, 2023.

3. Yan, J.; Wang, Q.; Cheng, Y.; Su, Z.; Zhang, F.; Zhong, M.; Liu, L.; Jin, B.; Zhang, W. Optimized singleimage super-resolution reconstruction: A multimodal approach based on reversible guidance and cyclical knowledge distillation. Engineering Applications of Artificial Intelligence Vol. 133, 108496, 2024.

4. Wang, M.; Xu, Z.; Liu, X.; Xiong, J.; Xie, W. Perceptually quasi-lossless compression of screen content data via visibility modeling and deep forecasting. IEEE Transactions on Industrial Informatics Vol. 18, No. 10, 6865–6875, 2022.

5. Chen, S.; Wang, J.; Pan, W.; Gao, S.; Wang, M.; Lu, X. Towards uniform point distribution in featurepreserving point cloud filtering. Computational Visual Media Vol. 9, No. 2, 249–263, 2023.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimized single-image super-resolution reconstruction: A multimodal approach based on reversible guidance and cyclical knowledge distillation;Engineering Applications of Artificial Intelligence;2024-07

2. Improving image super-resolution with structured knowledge distillation-based multimodal denoising diffusion probabilistic model;Journal of Electronic Imaging;2024-05-02

3. Single image super-resolution with denoising diffusion GANS;Scientific Reports;2024-02-21