TopicFM: Robust and Interpretable Topic-Assisted Feature Matching-Reference-Cited by-同舟云学术

TopicFM: Robust and Interpretable Topic-Assisted Feature Matching

Published:2023-06-26 Issue:2 Volume:37 Page:2447-2455
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Truong Giang Khang,Song Soohwan,Jo Sungho

Abstract

This study addresses an image-matching problem in challenging cases, such as large scene variations or textureless scenes. To gain robustness to such situations, most previous studies have attempted to encode the global contexts of a scene via graph neural networks or transformers. However, these contexts do not explicitly represent high-level contextual information, such as structural shapes or semantic instances; therefore, the encoded features are still not sufficiently discriminative in challenging scenes. We propose a novel image-matching method that applies a topic-modeling strategy to encode high-level contexts in images. The proposed method trains latent semantic instances called topics. It explicitly models an image as a multinomial distribution of topics, and then performs probabilistic feature matching. This approach improves the robustness of matching by focusing on the same semantic areas between the images. In addition, the inferred topics provide interpretability for matching the results, making our method explainable. Extensive experiments on outdoor and indoor datasets show that our method outperforms other state-of-the-art methods, particularly in challenging cases.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CorMatcher: A corners-guided graph neural network for local feature matching;Expert Systems with Applications;2024-12

2. A fire smoke segmentation system based on dual-modal image fusion;2024 IEEE International Conference on Real-time Computing and Robotics (RCAR);2024-06-24

3. IMPRL-Net: interpretable multi-view proximity representation learning network;Neural Computing and Applications;2024-05-12

4. OD-Net: Orthogonal descriptor network for multiview image keypoint matching;Information Fusion;2024-05

5. Using scale-equivariant CNN to enhance scale robustness in feature matching;The Visual Computer;2024-04-25