Affiliation:
1. Beijing University of Posts and Telecommunications, Beijing, P. R. China
2. Beihang University, Beijing, P. R. China, and Zhongguancun Laboratory, Beijing, P. R. China
3. Alibaba DAMO Academy, Beijing, P. R. China
Abstract
Image denoising is a fundamental problem in computer vision and multimedia computation. Non-local filters are effective for image denoising. But existing deep learning methods that use non-local computation structures are mostly designed for high-level tasks, and global self-attention is usually adopted. For the task of image denoising, they have high computational complexity and have a lot of redundant computation of uncorrelated pixels. To solve this problem and combine the marvelous advantages of non-local filter and deep learning, we propose a Convolutional Unbiased Regional (CUR) transformer. Based on the prior that, for each pixel, its similar pixels are usually spatially close, our insights are that (1) we partition the image into non-overlapped windows and perform regional self-attention to reduce the search range of each pixel, and (2) we encourage pixels across different windows to communicate with each other. Based on our insights, the CUR transformer is cascaded by a series of convolutional regional self-attention (CRSA) blocks with U-style short connections. In each CRSA block, we use convolutional layers to extract the query, key, and value features, namely
Q
,
K
, and
V
, of the input feature. Then, we partition the
Q
,
K
, and
V
features into local non-overlapped windows and perform regional self-attention within each window to obtain the output feature of this CRSA block. Among different CRSA blocks, we perform the unbiased window partition by changing the partition positions of the windows. Experimental results show that the CUR transformer outperforms the state-of-the-art methods significantly on four low-level vision tasks, including real and synthetic image denoising, JPEG compression artifact reduction, and low-light image enhancement.
Funder
National Nature Science Foundation of China
CAAI-Huawei MindSpore Open Fund
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference69 articles.
1. Abdelrahman Abdelhamed, Stephen Lin, and Michael S. Brown. 2018. A high-quality denoising dataset for smartphone cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1692–1700.
2. Andrew Adams, Natasha Gelfand, Jennifer Dolson, and Marc Levoy. 2009. Gaussian kd-trees for fast high-dimensional filtering. In ACM SIGGRAPH 2009 Papers. 1–12.
3. Vaswani Ashish, Ramachandran Prajit, Srinivas Aravind, Parmar Niki, Hechtman Blake, and Shlens Jonathon. 2021. Scaling local self-attention for parameter efficient visual backbones. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
4. Efficient Nonlocal Means for Denoising of Textural Patterns
5. A Non-Local Algorithm for Image Denoising
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献