Affiliation:
1. School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China
Abstract
There are significant background changes and complex spatial correspondences between multi-modal remote sensing images, and it is difficult for existing methods to extract common features between images effectively, leading to poor matching results. In order to improve the matching effect, features with high robustness are extracted; this paper proposes a multi-temporal remote sensing matching algorithm CMRM (CNN multi-modal remote sensing matching) based on deformable convolution and cross-attention. First, based on the VGG16 backbone network, Deformable VGG16 (DeVgg) is constructed by introducing deformable convolutions to adapt to significant geometric distortions in remote sensing images of different shapes and scales; second, the features extracted from DeVgg are input to the cross-attention module to better capture the spatial correspondence of images with background changes; and finally, the key points and corresponding descriptors are extracted from the output feature map. In the feature matching stage, in order to solve the problem of poor matching quality of feature points, BFMatcher is used for rough registration, and then the RANSAC algorithm with adaptive threshold is used for constraint. The proposed algorithm in this paper performs well on the public dataset HPatches, with MMA values of 0.672, 0.710, and 0.785 when the threshold is selected as 3–5. The results show that compared to existing methods, our method improves the matching accuracy of multi-modal remote sensing images.
Funder
Sichuan Natural Science Foundation
Sichuan Science and Technology Program
Science and Technology Major Project of Sichuan
Project of Innovation Ability Enhancement of Chengdu University of Information Technology
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference44 articles.
1. Image registration methods: A survey;Flusser;Image Vis. Comput.,2003
2. Remote sensing image registration methodology: Review and discussion;Tondewad;Procedia Comput. Sci.,2020
3. Muja, M. (2009). FLANN-Fast Library for Approximate Nearest Neighbors User Manual, Computer Science Department, University of British Columbia. Available online: https://www.fit.vutbr.cz/~ibarina/pub/VGE/reading/flann_manual-1.6.pdf.
4. Distinctive image features from scale-invariant keypoints;Lowe;Int. J. Comput. Vis.,2004
5. Kang, W., and Liu, Y. (2014). Contact-free palm-vein recognition based on local invariant features. PLoS ONE, 9.