Affiliation:
1. School of Computer Science, Civil Aviation Flight University of China, Guanghan 618307, China
Abstract
Homography estimation for infrared and visible images is a critical and fundamental task in multimodal image processing. Recently, the coarse-to-fine strategy has been gradually applied to the homography estimation task and has proved to be effective. However, current coarse-to-fine homography estimation methods typically require the introduction of additional neural networks to acquire multi-scale feature maps and the design of complex homography matrix fusion strategies. In this paper, we propose a new unsupervised homography estimation method for infrared and visible images. First, we design a novel coarse-to-fine strategy. This strategy utilizes different stages in the regression network to obtain multi-scale feature maps, enabling the progressive refinement of the homography matrix. Second, we design a local correlation transformer (LCTrans), which aims to capture the intrinsic connections between local features more precisely, thus highlighting the features crucial for homography estimation. Finally, we design an average feature correlation loss (AFCL) to enhance the robustness of the model. Through extensive experiments, we validated the effectiveness of all the proposed components. Experimental results demonstrate that our method outperforms existing methods on synthetic benchmark datasets in both qualitative and quantitative comparisons.
Funder
National Key R&D Program of China
Science and Technology Plan Project of Sichuan Province
Fundamental Research Funds for the Central Universities
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference49 articles.
1. A view-free image stitching network based on global homography;Nie;J. Vis. Commun. Image Represent.,2020
2. Deep Image Registration with Depth-Aware Homography Estimation;Huang;IEEE Signal Process. Lett.,2023
3. Reinforcement learning-based image exposure reconstruction for homography estimation;Lin;Appl. Intell.,2023
4. Son, D.-M., Kwon, H.-J., and Lee, S.-H. (2022). Visible and Near Infrared Image Fusion Using Base Tone Compression and Detail Transform Fusion. Chemosensors, 10.
5. YOLACTFusion: An instance segmentation method for RGB-NIR multimodal image fusion based on an attention mechanism;Liu;Comput. Electron. Agric.,2023
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献