Row–Column Separated Attention Based Low‐Light Image/Video Enhancement-Reference-Cited by-同舟云学术

Row–Column Separated Attention Based Low‐Light Image/Video Enhancement

Published:2024-08-29 Issue: Volume: Page:
ISSN:0167-7055
Container-title:Computer Graphics Forum
language:en
Short-container-title:Computer Graphics Forum

Author:

Dong Chengqi¹,Cao Zhiyuan²,Qi Tuoshi¹,Wu Kexin¹,Gao Yixing¹,Tang Fan³

Affiliation:

1. School of Artificial Intelligence Jilin University Changchun China

2. College of Software Jilin University Changchun China

3. Institute of Computing Technology Chinese Academy of Sciences Beijing China

Abstract

AbstractU‐Net structure is widely used for low‐light image/video enhancement. The enhanced images result in areas with large local noise and loss of more details without proper guidance for global information. Attention mechanisms can better focus on and use global information. However, attention to images could significantly increase the number of parameters and computations. We propose a Row–Column Separated Attention module (RCSA) inserted after an improved U‐Net. The RCSA module's input is the mean and maximum of the row and column of the feature map, which utilizes global information to guide local information with fewer parameters. We propose two temporal loss functions to apply the method to low‐light video enhancement and maintain temporal consistency. Extensive experiments on the LOL, MIT Adobe FiveK image, and SDSD video datasets demonstrate the effectiveness of our approach.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/cgf.15192

Reference63 articles.

1. ArnabA. DehghaniM. HeigoldG. SunC. LučićM. SchmidC.:ViViT: A video vision transformer. InICCV(Montreal BC Canada 2021) pp. 6836–6846.https://doi.org/10.1109/ICCV48922.2021.00676.

2. AghajanzadehS. ForsythD.:Long scale error control in low light image and video enhancement using equivariance.arXiv preprint arXiv:2206.01334(2022).https://doi.org/10.48550/arXiv.2206.01334.

3. BychkovskyV. ParisS. ChanE. DurandF.:Learning photographic global tonal adjustment with a database of input/output image pairs. InCVPR 2011(Colorado Springs 2011) pp. 97–104.https://doi.org/10.1109/CVPR.2011.5995413.

4. CaiY. BianH. LinJ. WangH. TimofteR. ZhangY.:Retinexformer: One‐stage retinex‐based transformer for low‐light image enhancement. InProceedings of the IEEE/CVF International Conference on Computer Vision(Paris France 2023) pp. 12504–12513.https://arxiv.org/abs/2303.06705.

5. ChenC. ChenQ. DoM. N. KoltunV.:Seeing motion in the dark. InICCV(2019) COEX Convention Center pp. 3185–3194.https://doi.org/10.1109/ICCV.2019.00328.