Efficient Discrimination and Localization of Multimodal Remote Sensing Images Using CNN-Based Prediction of Localization Uncertainty-Reference-Cited by-同舟云学术

Efficient Discrimination and Localization of Multimodal Remote Sensing Images Using CNN-Based Prediction of Localization Uncertainty

Published:2020-02-20 Issue:4 Volume:12 Page:703
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Uss Mykhail^ORCID,Vozel Benoit^ORCID,Lukin Vladimir^ORCID,Chehdi Kacem

Abstract

Detecting similarities between image patches and measuring their mutual displacement are important parts in the registration of multimodal remote sensing (RS) images. Deep learning approaches advance the discriminative power of learned similarity measures (SM). However, their ability to find the best spatial alignment of the compared patches is often ignored. We propose to unify the patch discrimination and localization problems by assuming that the more accurately two patches can be aligned, the more similar they are. The uncertainty or confidence in the localization of a patch pair serves as a similarity measure of these patches. We train a two-channel patch matching convolutional neural network (CNN), called DLSM, to solve a regression problem with uncertainty. This CNN inputs two multimodal patches, and outputs a prediction of the translation vector between the input patches as well as the uncertainty of this prediction in the form of an error covariance matrix of the translation vector. The proposed patch matching CNN predicts a normal two-dimensional distribution of the translation vector rather than a simple value of it. The determinant of the covariance matrix is used as a measure of uncertainty in the matching of patches and also as a measure of similarity between patches. For training, we used the Siamese architecture with three towers. During training, the input of two towers is the same pair of multimodal patches but shifted by a random translation; the last tower is fed by a pair of dissimilar patches. Experiments performed on a large base of real RS images show that the proposed DLSM has both a higher discriminative power and a more precise localization compared to existing hand-crafted SMs and SMs trained with conventional losses. Unlike existing SMs, DLSM correctly predicts translation error distribution ellipse for different modalities, noise level, isotropic, and anisotropic structures.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/12/4/703/pdf

Reference56 articles.

1. Multimodal Remote Sensing Image Registration With Accuracy Estimation at Local and Global Scales

2. Robust Point Matching via Vector Field Consensus

3. Image Registration for Remote Sensing;Le Moigne,2011

4. TS-NET: Combining Modality Specific and Common Features for Multimodal Patch Matching

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Neural Network Architectures for Assessing the Accuracy of Image Registration;2023 IEEE International Conference on Information and Telecommunication Technologies and Radio Electronics (UkrMiCo);2023-11-13

2. Multi-Resolution Feature Extraction and Fusion for Traditional Village Landscape Analysis in Remote Sensing Imagery;Traitement du Signal;2023-06-28

3. From single- to multi-modal remote sensing imagery interpretation: a survey and taxonomy;Science China Information Sciences;2023-03-27

4. Attention-Based Matching Approach for Heterogeneous Remote Sensing Images;Remote Sensing;2022-12-27

5. Improvement of Spatial Localization Accuracy in Learning-Based Patch Matching Using Anisotropic Fractal Brownian Motion Data;IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium;2022-07-17