Multi-resolution Twinned Residual Auto-Encoders (MR-TRAE)—A Novel DL Model for Image Multi-resolution-Reference-Cited by-同舟云学术

Multi-resolution Twinned Residual Auto-Encoders (MR-TRAE)—A Novel DL Model for Image Multi-resolution

Published:2024-05-21 Issue:4 Volume:16 Page:1447-1469
ISSN:1866-9956
Container-title:Cognitive Computation
language:en
Short-container-title:Cogn Comput

Author:

Momenzadeh Alireza,Baccarelli Enzo,Scarpiniti Michele,Sarv Ahrabi Sima

Abstract

AbstractIn this paper, we design and evaluate the performance of the Multi-resolution Twinned Residual Auto-Encoders (MR-TRAE) model, a deep learning (DL)-based architecture specifically designed for achieving multi-resolution super-resolved images from low-resolution (LR) inputs at various scaling factors. For this purpose, we expand on the recently introduced Twinned Residual Auto-Encoders (TRAE) paradigm for single-image super-resolution (SISR) to extend it to the multi-resolution (MR) domain. The main contributions of this work include (i) the architecture of the MR-TRAE model, which utilizes cascaded trainable up-sampling modules for progressively increasing the spatial resolution of low-resolution (LR) input images at multiple scaling factors; (ii) a novel loss function designed for the joint and semi-blind training of all MR-TRAE model components; and (iii) a comprehensive analysis of the MR-TRAE trade-off between model complexity and performance. Furthermore, we thoroughly explore the connections between the MR-TRAE architecture and broader cognitive paradigms, including knowledge distillation, the teacher-student learning model, and hierarchical cognition. Performance evaluations of the MR-TRAE benchmarked against state-of-the-art models (such as U-Net, generative adversarial network (GAN)-based, and single-resolution baselines) were conducted using publicly available datasets. These datasets consist of LR computer tomography (CT) scans from patients with COVID-19. Our tests, which explored multi-resolutions at scaling factors

$$\times (2,4,8)$$

× ( 2 , 4 , 8 ) , showed a significant finding: the MR-TRAE model can reduce training times by up to

$$60\%$$

60 % compared to those of the baselines, without a noticeable impact on achieved performance.

Funder

Università degli Studi di Roma La Sapienza

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s12559-024-10293-1.pdf

Reference51 articles.

1. Chen H, He X, et al. Real-world single image super-resolution: a brief review. Inf Fusion. 2022;79:124–45. https://doi.org/10.1016/j.inffus.2021.09.005.

2. Chauhan K, Patel SN, et al. Deep learning-based single-image super-resolution: a comprehensive review. IEEE Access. 2023;11:21811–30. https://doi.org/10.1109/ACCESS.2023.3251396.

3. Villar-Corrales A, Schirrmacher F, Riess C. Deep learning architectural designs for super-resolution of noisy images. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP’21. Toronto: IEEE; 2021. pp. 1635–39. https://doi.org/10.1109/ICASSP39728.2021.9414733.

4. Lepcha DC, Goyal B, et al. Image super-resolution: a comprehensive review, recent trends, challenges and applications. Inf Fusion. 2023;91:230–60. https://doi.org/10.1016/j.inffus.2022.10.007.

5. Dabov K, Foi A, et al. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans Image Process. 2007;16(8):2080–95. https://doi.org/10.1109/TIP.2007.901238.