CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising-Reference-Cited by-同舟云学术

CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising

Published:2023-02-25 Issue:3 Volume:19 Page:1-22
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Xu Kang¹^ORCID,Li Weixin²^ORCID,Wang Xia¹^ORCID,Hu Xiaoyan¹^ORCID,Yan Ke³^ORCID,Wang Xiaojie¹^ORCID,Dong Xuan¹^ORCID

Affiliation:

1. Beijing University of Posts and Telecommunications, Beijing, P. R. China

2. Beihang University, Beijing, P. R. China, and Zhongguancun Laboratory, Beijing, P. R. China

3. Alibaba DAMO Academy, Beijing, P. R. China

Abstract

Image denoising is a fundamental problem in computer vision and multimedia computation. Non-local filters are effective for image denoising. But existing deep learning methods that use non-local computation structures are mostly designed for high-level tasks, and global self-attention is usually adopted. For the task of image denoising, they have high computational complexity and have a lot of redundant computation of uncorrelated pixels. To solve this problem and combine the marvelous advantages of non-local filter and deep learning, we propose a Convolutional Unbiased Regional (CUR) transformer. Based on the prior that, for each pixel, its similar pixels are usually spatially close, our insights are that (1) we partition the image into non-overlapped windows and perform regional self-attention to reduce the search range of each pixel, and (2) we encourage pixels across different windows to communicate with each other. Based on our insights, the CUR transformer is cascaded by a series of convolutional regional self-attention (CRSA) blocks with U-style short connections. In each CRSA block, we use convolutional layers to extract the query, key, and value features, namely Q , K , and V , of the input feature. Then, we partition the Q , K , and V features into local non-overlapped windows and perform regional self-attention within each window to obtain the output feature of this CRSA block. Among different CRSA blocks, we perform the unbiased window partition by changing the partition positions of the windows. Experimental results show that the CUR transformer outperforms the state-of-the-art methods significantly on four low-level vision tasks, including real and synthetic image denoising, JPEG compression artifact reduction, and low-light image enhancement.

Funder

National Nature Science Foundation of China

CAAI-Huawei MindSpore Open Fund

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3566125

Reference69 articles.

1. Abdelrahman Abdelhamed, Stephen Lin, and Michael S. Brown. 2018. A high-quality denoising dataset for smartphone cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1692–1700.

2. Andrew Adams, Natasha Gelfand, Jennifer Dolson, and Marc Levoy. 2009. Gaussian kd-trees for fast high-dimensional filtering. In ACM SIGGRAPH 2009 Papers. 1–12.

3. Vaswani Ashish, Ramachandran Prajit, Srinivas Aravind, Parmar Niki, Hechtman Blake, and Shlens Jonathon. 2021. Scaling local self-attention for parameter efficient visual backbones. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

4. Efficient Nonlocal Means for Denoising of Textural Patterns

5. A Non-Local Algorithm for Image Denoising

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dual-branch network based on transformer for texture recognition;Digital Signal Processing;2024-10

2. Wavelet–pixel domain progressive fusion network for underwater image enhancement;Knowledge-Based Systems;2024-09

3. HCSAM-Net: multistage network with a hybrid of convolution and self-attention mechanism for low-light image enhancement;Multimedia Tools and Applications;2024-08-10

4. Efficient image denoising with heterogeneous kernel-based CNN;Neurocomputing;2024-08

5. Enhancing Object Detection in Distorted Environments: A Seamless Integration of Classical Image Processing with Deep Learning Models;2024 IEEE 12th International Symposium on Signal, Image, Video and Communications (ISIVC);2024-05-21