Contrastive Multiscale Transformer for Image Dehazing-Reference-Cited by-同舟云学术

Contrastive Multiscale Transformer for Image Dehazing

Published:2024-03-22 Issue:7 Volume:24 Page:2041
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Chen Jiawei¹,Zhao Guanghui¹^ORCID

Affiliation:

1. School of Artificial Intelligence, Xidian University, Xi’an 710071, China

Abstract

Images obtained in an unfavorable environment may be affected by haze or fog, leading to fuzzy image details, low contrast, and loss of important information. Recently, significant progress has been achieved in the realm of image dehazing, largely due to the adoption of deep learning techniques. Owing to the lack of modules specifically designed to learn the unique characteristics of haze, existing deep neural network-based methods are impractical for processing images containing haze. In addition, most networks primarily focus on learning clear image information while disregarding potential features in hazy images. To address these limitations, we propose an innovative method called contrastive multiscale transformer for image dehazing (CMT-Net). This method uses the multiscale transformer to enable the network to learn global hazy features at multiple scales. Furthermore, we introduce feature combination attention and a haze-aware module to enhance the network’s ability to handle varying concentrations of haze by assigning more weight to regions containing haze. Finally, we design a multistage contrastive learning loss incorporating different positive and negative samples at various stages to guide the network’s learning process to restore real and non-hazy images. The experimental findings demonstrate that CMT-Net provides exceptional performance on established datasets and exhibits superior visual outcomes.

Funder

Key Research and Development Plan Projects of Shaanxi Province

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/7/2041/pdf

Reference57 articles.

1. Sakaridis, C., Dai, D., Hecker, S., and Van Gool, L. (2018, January 8–14). Model adaptation with synthetic and real data for semantic dense foggy scene understanding. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.

2. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020, January 23–28). End-to-end object detection with transformers. Proceedings of the European Conference on Computer Vision, Glasgow, UK.

3. Real time image and video deweathering: The future prospects and possibilities;Kumari;Optik,2016

4. Prakash, A., Chitta, K., and Geiger, A. (2021, January 25). Multi-modal fusion transformer for end-to-end autonomous driving. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.

5. McCartney, E.J. (1976). Optics of the Atmosphere: Scattering by Molecules and Particles, John Wiley and Sons, Inc. .

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A lightweight attention-based network for image dehazing;Signal, Image and Video Processing;2024-07-05

2. Universal Image Restoration with Text Prompt Diffusion;Sensors;2024-06-17