ESTUGAN: Enhanced Swin Transformer with U-Net Discriminator for Remote Sensing Image Super-Resolution-Reference-Cited by-同舟云学术

ESTUGAN: Enhanced Swin Transformer with U-Net Discriminator for Remote Sensing Image Super-Resolution

Published:2023-10-13 Issue:20 Volume:12 Page:4235
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yu Chunhe¹,Hong Lingyue¹^ORCID,Pan Tianpeng¹^ORCID,Li Yufeng¹,Li Tingting¹

Affiliation:

1. School of Electronical and Information Engineering, Shenyang Aerospace University, Shenyang 110136, China

Abstract

Remote sensing image super-resolution (SR) is a practical research topic with broad applications. However, the mainstream algorithms for this task suffer from limitations. CNN-based algorithms face difficulties in modeling long-term dependencies, while generative adversarial networks (GANs) are prone to producing artifacts, making it difficult to reconstruct high-quality, detailed images. To address these challenges, we propose ESTUGAN for remote sensing image SR. On the one hand, ESTUGAN adopts the Swin Transformer as the network backbone and upgrades it to fully mobilize input information for global interaction, achieving impressive performance with fewer parameters. On the other hand, we employ a U-Net discriminator with the region-aware learning strategy for assisted supervision. The U-shaped design enables us to obtain structural information at each hierarchy and provides dense pixel-by-pixel feedback on the predicted images. Combined with the region-aware learning strategy, our U-Net discriminator can perform adversarial learning only for texture-rich regions, effectively suppressing artifacts. To achieve flexible supervision for the estimation, we employ the Best-buddy loss. And we also add the Back-projection loss as a constraint for the faithful reconstruction of the high-resolution image distribution. Extensive experiments demonstrate the superior perceptual quality and reliability of our proposed ESTUGAN in reconstructing remote sensing images.

Funder

Liaoning Provincial Science and Technology Department

Shenyang Municipal Natural Science Foundation

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/20/4235/pdf

Reference65 articles.

1. Dong, C., Loy, C.C., He, K., and Tang, X. (2014, January 6–12). Learning a Deep Convolutional Network for Image Super-Resolution. Proceedings of the Computer Vision—ECCV 2014, Zurich, Switzerland.

2. Kim, J., Lee, J.K., and Lee, K.M. (July, January 26). Accurate Image Super-Resolution Using Very Deep Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

3. Kim, J., Lee, J.K., and Lee, K.M. (July, January 26). Deeply-Recursive Convolutional Network for Image Super-Resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

4. Lai, W.S., Huang, J.B., Ahuja, N., and Yang, M.H. (July, January 26). Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.

5. Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., and Fu, Y. (2018, January 8–14). Image Super-Resolution Using Very Deep Residual Channel Attention Networks. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhanced Remote Sensing Image Compression Method Using Large Network with Sparse Extracting Strategy;Electronics;2024-07-08

2. Learning the Frequency Domain Aliasing for Real-World Super-Resolution;Electronics;2024-01-05