Affiliation:
1. Nanyang Technological University, Singapore
2. Columbia University, New York, NY
Abstract
Learning a good distance metric plays a vital role in many multimedia retrieval and data mining tasks. For example, a typical content-based image retrieval (CBIR) system often relies on an effective distance metric to measure similarity between any two images. Conventional CBIR systems simply adopting Euclidean distance metric often fail to return satisfactory results mainly due to the well-known semantic gap challenge. In this article, we present a novel framework of
Semi-Supervised Distance Metric Learning
for learning effective distance metrics by exploring the historical relevance feedback log data of a CBIR system and utilizing unlabeled data when log data are limited and noisy. We formally formulate the learning problem into a convex optimization task and then present a new technique, named as “Laplacian Regularized Metric Learning” (LRML). Two efficient algorithms are then proposed to solve the LRML task. Further, we apply the proposed technique to two applications. One direct application is for Collaborative Image Retrieval (CIR), which aims to explore the CBIR log data for improving the retrieval performance of CBIR systems. The other application is for Collaborative Image Clustering (CIC), which aims to explore the CBIR log data for enhancing the clustering performance of image pattern clustering tasks. We conduct extensive evaluation to compare the proposed LRML method with a number of competing methods, including 2 standard metrics, 3 unsupervised metrics, and 4 supervised metrics with side information. Encouraging results validate the effectiveness of the proposed technique.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference41 articles.
1. Cover trees for nearest neighbor
2. Boyd S. and Vandenberghe L. 2003. Convex Optimization. Cambridge University Press. Boyd S. and Vandenberghe L. 2003. Convex Optimization. Cambridge University Press.
3. Cox T. and Cox M. 1994. Multidimensional Scaling. Chapman & Hall London. Cox T. and Cox M. 1994. Multidimensional Scaling. Chapman & Hall London.
4. Dom B. E. 2001. An information-theoretic external cluster-validity measure. Res. rep. RJ 10219 IBM. Dom B. E. 2001. An information-theoretic external cluster-validity measure. Res. rep. RJ 10219 IBM.
Cited by
107 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献