Affiliation:
1. School of Artificial Intelligence, Xidian University, Xi’an 710071, China
Abstract
Images obtained in an unfavorable environment may be affected by haze or fog, leading to fuzzy image details, low contrast, and loss of important information. Recently, significant progress has been achieved in the realm of image dehazing, largely due to the adoption of deep learning techniques. Owing to the lack of modules specifically designed to learn the unique characteristics of haze, existing deep neural network-based methods are impractical for processing images containing haze. In addition, most networks primarily focus on learning clear image information while disregarding potential features in hazy images. To address these limitations, we propose an innovative method called contrastive multiscale transformer for image dehazing (CMT-Net). This method uses the multiscale transformer to enable the network to learn global hazy features at multiple scales. Furthermore, we introduce feature combination attention and a haze-aware module to enhance the network’s ability to handle varying concentrations of haze by assigning more weight to regions containing haze. Finally, we design a multistage contrastive learning loss incorporating different positive and negative samples at various stages to guide the network’s learning process to restore real and non-hazy images. The experimental findings demonstrate that CMT-Net provides exceptional performance on established datasets and exhibits superior visual outcomes.
Funder
Key Research and Development Plan Projects of Shaanxi Province
Reference57 articles.
1. Sakaridis, C., Dai, D., Hecker, S., and Van Gool, L. (2018, January 8–14). Model adaptation with synthetic and real data for semantic dense foggy scene understanding. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.
2. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020, January 23–28). End-to-end object detection with transformers. Proceedings of the European Conference on Computer Vision, Glasgow, UK.
3. Real time image and video deweathering: The future prospects and possibilities;Kumari;Optik,2016
4. Prakash, A., Chitta, K., and Geiger, A. (2021, January 25). Multi-modal fusion transformer for end-to-end autonomous driving. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.
5. McCartney, E.J. (1976). Optics of the Atmosphere: Scattering by Molecules and Particles, John Wiley and Sons, Inc. .
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献