Design of GPU Network-on-Chip for Real-Time Video Super-Resolution Reconstruction-Reference-Cited by-同舟云学术

Design of GPU Network-on-Chip for Real-Time Video Super-Resolution Reconstruction

Published:2023-05-16 Issue:5 Volume:14 Page:1055
ISSN:2072-666X
Container-title:Micromachines
language:en
Short-container-title:Micromachines

Author:

Peng Zhiyong¹,Du Jiang¹,Qiao Yulong²

Affiliation:

1. School of Optoelectronic Engineering, Guilin University of Electronic Technology, Guilin 541004, China

2. School of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China

Abstract

Deep learning has a better output quality compared with traditional algorithms for video super-resolution (SR), but the network model needs large resources and has poor real-time performance. This paper focuses on solving the speed problem of SR; it achieves real-time SR by the collaborative design of a deep learning video SR algorithm and GPU parallel acceleration. An algorithm combining deep learning networks with a lookup table (LUT) is proposed for the video SR, which ensures both the SR effect and ease of GPU parallel acceleration. The computational efficiency of the GPU network-on-chip algorithm is improved to ensure real-time performance by three major GPU optimization strategies: storage access optimization, conditional branching function optimization, and threading optimization. Finally, the network-on-chip was implemented on a RTX 3090 GPU, and the validity of the algorithm was demonstrated through ablation experiments. In addition, SR performance is compared with existing classical algorithms based on standard datasets. The new algorithm was found to be more efficient than the SR-LUT algorithm. The average PSNR was 0.61 dB higher than the SR-LUT-V algorithm and 0.24 dB higher than the SR-LUT-S algorithm. At the same time, the speed of real video SR was tested. For a real video with a resolution of 540×540, the proposed GPU network-on-chip achieved a speed of 42 FPS. The new method is 9.1 times faster than the original SR-LUT-S fast method, which was directly imported into the GPU for processing.

Funder

Natural Science Foundation of Guangxi Province

Innovation Project of Guangxi Graduate Education

Graduate Education Innovation Program of Guilin University of Electronic Technology

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Mechanical Engineering,Control and Systems Engineering

Link

https://www.mdpi.com/2072-666X/14/5/1055/pdf

Reference33 articles.

1. Dong, C., Loy, C.C., He, K., and Tang, X. (2014, January 6–12). Learning a Deep Convolutional Network for Image Super-Resolution. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.

2. Dong, C., Loy, C.C., and Tang, X. (2016, January 8–16). Accelerating the Super-Resolution Convolutional Neural Network. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.

3. Lai, W., Huang, J., Ahuja, N., and Yang, M. (2017, January 21–26). Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.

4. Hui, Z., Gao, X., Yang, Y., and Wang, X. (2019, January 21–25). Lightweight Image Super-Resolution with Information Multi-distillation Network. Proceedings of the 27th ACM International Conference on Multimedia, Nice, France.

5. Li, Z., Yang, J., Liu, Z., Yang, X., Jeon, G., and Wu, W. (2019, January 15–20). Feedback Network for Image Super-Resolution. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.