PSRGAN: Perception-Design-Oriented Image Super Resolution Generative Adversarial Network
-
Published:2023-10-27
Issue:21
Volume:12
Page:4420
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Wu Tao1ORCID, Xiong Shuo2ORCID, Liu Hui1ORCID, Zhao Yangyang3, Tuo Haoran1, Li Yi1, Zhang Jiaxin1, Liu Huaizheng1
Affiliation:
1. National Model Software Institute, Huazhong University of Science and Technology, Wuhan 430074, China 2. PSS Lab of Big Data and National Communication Strategy, MOE, Huazhong University of Science and Technology, Wuhan 430074, China 3. Hytera Communication Co., Ltd., Shenzhen 518057, China
Abstract
Among recent state-of-the-art realistic image super-resolution (SR) intelligent algorithms, generative adversarial networks (GANs) have achieved impressive visual performance. However, there has been the problem of unsatisfactory perception of super-scored pictures with unpleasant artifacts. To address this issue and further improve visual quality, we proposed a perception-design-oriented PSRGAN with double perception turbos for real-world SR. The first-perception turbo in the generator network has a three-level perception structure with different convolution kernel sizes, which can extract multi-scale features from four 14 size sub-images sliced by original LR image. The slice operation expands adversarial samples to four and could alleviate artifacts during GAN training. The extracted features will be eventually concatenated in later 3 × 2 upsampling processes through pixel shuffle to restore SR image with diversified delicate textures. The second-perception turbo in discriminators has cascaded perception turbo blocks (PTBs), which could further perceive multi-scale features at various spatial relationships and promote the generator to restore subtle textures driven by GAN. Compared with recent SR methods (BSRGAN, real-ESRGAN, PDM_SR, SwinIR, LDL, etc.), we conducted an extensive test with a ×4 upscaling factor on various datasets (OST300, 2020track1, RealSR-Canon, RealSR-Nikon, etc.). We conducted a series of experiments that show that our proposed PSRGAN based on generative adversarial networks outperforms current state-of-the-art intelligent algorithms on several evaluation metrics, including NIQE, NRQM and PI. In terms of visualization, PSRGAN generates finer and more natural textures while suppressing unpleasant artifacts and achieves significant improvements in perceptual quality.
Funder
Tencent Technology (Shenzhen) Co., Ltd.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference61 articles.
1. Lanczos filtering in one and two dimensions;Duchon;J. Appl. Meteorol. Climatol.,1979 2. An edge-guided image interpolation algorithm via directional filtering and data fusion;Zhang;IEEE Trans. Image Process.,2006 3. Wu, Y., Ding, H., Gong, M., Qin, A., Ma, W., Miao, Q., and Tan, K.C. (2022). Evolutionary multiform optimization with two-stage bidirectional knowledge transfer strategy for point cloud registration. IEEE Trans. Evol. Comput. 4. Wu, Y., Zhang, Y., Ma, W., Gong, M., Fan, X., Zhang, M., Qin, A., and Miao, Q. (2023). Rornet: Partial-to-partial registration network with reliable overlapping representations. IEEE Trans. Neural Netw. Learn. Syst. 5. Softcuts: A soft edge smoothness prior for color image super-resolution;Dai;IEEE Trans. Image Process.,2009
|
|