Affiliation:
1. Department of Control and Robot Engineering, Chungbuk National University, Cheongju 28644, Republic of Korea
Abstract
The objective of content-based image retrieval (CBIR) is to locate samples from a database that are akin to a query, relying on the content embedded within the images. A contemporary strategy involves calculating the similarity between compact vectors by encoding both the query and the database images as global descriptors. In this work, we propose an image retrieval method by using hierarchical K-means clustering to efficiently organize the image descriptors within the database, which aims to optimize the subsequent retrieval process. Then, we compute the similarity between the descriptor set within the leaf nodes and the query descriptor to rank them accordingly. Three tree search algorithms are presented to enable a trade-off between search accuracy and speed that allows for substantial gains at the expense of a slightly reduced retrieval accuracy. Our proposed method demonstrates enhancement in image retrieval speed when applied to the CLIP-based model, UNICOM, designed for category-level retrieval, as well as the CNN-based R-GeM model, tailored for particular object retrieval by validating its effectiveness across various domains and backbones. We achieve an 18-times speed improvement while preserving over 99% accuracy when applied to the In-Shop dataset, the largest dataset in the experiments.
Funder
Grand Information Technology Research Support Program
Institute of Information & Communications Technology Planning & Evaluation
Korea Institute of Science and Technology
Reference57 articles.
1. Gordoa, A., Rodriguez-Serrano, J.A., Perronnin, F., and Valveny, E. (2012, January 6–21). Leveraging category-level labels for instance-level image retrieval. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.
2. Tolias, G., Sicre, R., and Jégou, H. (2015). Particular object retrieval with integral max-pooling of CNN activations. arXiv.
3. Fine-tuning CNN image retrieval with no human annotation;Tolias;IEEE Trans. Pattern Anal. Mach. Intell.,2018
4. Deep learning for instance retrieval: A survey;Chen;IEEE Trans. Pattern Anal. Mach. Intell.,2022
5. El-Nouby, A., Neverova, N., Laptev, I., and Jégou, H. (2021). Training vision transformers for image retrieval. arXiv.