Author:
Li Jia,Zhou Yu-qian,Zhang Qiu-yan
Abstract
IntroductionMetric learning, as a fundamental research direction in the field of computer vision, has played a crucial role in image matching. Traditional metric learning methods aim at constructing two-branch siamese neural networks to address the challenge of image matching, but they often overlook to cross-source and cross-view scenarios.MethodsIn this article, a multi-branch metric learning model is proposed to address these limitations. The main contributions of this work are as follows: Firstly, we design a multi-branch siamese network model that enhances measurement reliability through information compensation among data points. Secondly, we construct a non-local information perception and fusion model, which accurately distinguishes positive and negative samples by fusing information at different scales. Thirdly, we enhance the model by integrating semantic information and establish an information consistency mapping between multiple branches, thereby improving the robustness in cross-source and cross-view scenarios.ResultsExperimental tests which demonstrate the effectiveness of the proposed method are carried out under various conditions, including homologous, heterogeneous, multi-view, and crossview scenarios. Compared to the state-of-the-art comparison algorithms, our proposed algorithm achieves an improvement of ~1, 2, 1, and 1% in terms of similarity measurement Recall@10, respectively, under these four conditions.DiscussionIn addition, our work provides an idea for improving the crossscene application ability of UAV positioning and navigation algorithm.
Subject
Artificial Intelligence,Biomedical Engineering
Reference58 articles.
1. “Elasticface: elastic margin loss for deep face recognition,”;Boutros;Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2022
2. Siamese neural networks: an overview;Chicco;Artif. Neural Netw,2021
3. Features for image retrieval: an experimental comparison;Deselaers;Inf. Retriev. J,2008
4. The pascal visual object classes (voc) challenge;Everingham;Int. J. Comp. Vis,2010
5. “Clothes-changing person re-identification with rgb modality only,”;Gu;Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition