Abstract
Conditional image retrieval (CIR), which involves retrieving images by a query image along with user-specified conditions, is essential in computer vision research for efficient image search and automated image analysis. The existing approaches, such as composed image retrieval (CoIR) methods, have been actively studied. However, these methods face challenges as they require either a triplet dataset or richly annotated image-text pairs, which are expensive to obtain. In this work, we demonstrate that CIR at the image-level concept can be achieved using an inverse mapping approach that explores the model’s inductive knowledge. Our proposed CIR method, called Backward Search, updates the query embedding to conform to the condition. Specifically, the embedding of the query image is updated by predicting the probability of the label and minimizing the difference from the condition label. This enables CIR with image-level concepts while preserving the context of the query. In this paper, we introduce the Backward Search method that enables single and multi-conditional image retrieval. Moreover, we efficiently reduce the computation time by distilling the knowledge. We conduct experiments using the WikiArt, aPY, and CUB benchmark datasets. The proposed method achieves an average mAP@10 of 0.541 on the datasets, demonstrating a marked improvement compared to the CoIR methods in our comparative experiments. Furthermore, by employing knowledge distillation with the Backward Search model as the teacher, the student model achieves a significant reduction in computation time, up to 160 times faster with only a slight decrease in performance. The implementation of our method is available at the following URL: https://github.com/dhlee-work/BackwardSearch.
Publisher
Public Library of Science (PLoS)
Reference52 articles.
1. Content-based image retrieval at the end of the early years;AWM Smeulders;IEEE Transactions on Pattern Analysis and Machine Intelligence,2000
2. A Decade Survey of Content Based Image Retrieval using Deep Learning;SR Dubey;IEEE Transactions on Circuits and Systems for Video Technology,2021
3. Content-based multimedia information retrieval: State of the art and challenges;M Lew;ACM Transactions on Multimedia Computing, Communications, and Applications,2006
4. Performance evaluation in content-based image retrieval: overview and proposals;H Müller;Pattern Recognition Letters,2001
5. A review on automatic image annotation techniques;D Zhang;Pattern Recognition,2012