Abstract
Recently, both single modality and cross modality near-duplicate image detection tasks have received wide attention in the community of pattern recognition and computer vision. Existing deep neural networks-based methods have achieved remarkable performance in this task. However, most of the methods mainly focus on the learning of each image from the image pair, thus leading to less use of the information between the near duplicate image pairs to some extent. In this paper, to make more use of the correlations between image pairs, we propose a spatial transformer comparing convolutional neural network (CNN) model to compare near-duplicate image pairs. Specifically, we firstly propose a comparing CNN framework, which is equipped with a cross-stream to fully learn the correlation information between image pairs, while considering the features of each image. Furthermore, to deal with the local deformations led by cropping, translation, scaling, and non-rigid transformations, we additionally introduce a spatial transformer comparing CNN model by incorporating a spatial transformer module to the comparing CNN architecture. To demonstrate the effectiveness of the proposed method on both the single-modality and cross-modality (Optical-InfraRed) near-duplicate image pair detection tasks, we conduct extensive experiments on three popular benchmark datasets, namely CaliforniaND (ND means near duplicate), Mir-Flickr Near Duplicate, and TNO Multi-band Image Data Collection. The experimental results show that the proposed method can achieve superior performance compared with many state-of-the-art methods on both tasks.
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Automated Near-Duplicate Image Detection using Sparrow Search Algorithm with Deep Learning Model;2024 International Conference on Cognitive Robotics and Intelligent Systems (ICC - ROBINS);2024-04-17
2. Cross-ViT: Cross-attention Vision Transformer for Image Duplicate Detection;2023 8th International Conference on Information Technology Research (ICITR);2023-12-07
3. Modelling of Firefly Algorithm with Densely Connected Networks for Near-Duplicate Image Detection System;2023 International Conference on Sustainable Communication Networks and Application (ICSCNA);2023-11-15
4. Vista Morph - Unsupervised Image Registration of Visible-Thermal Facial Pairs;2023 IEEE International Joint Conference on Biometrics (IJCB);2023-09-25
5. flowSim: Near duplicate detection for flow cytometry data;Cytometry Part A;2023-08-29