Affiliation:
1. Microsoft Research Asia, Beijing, China
Abstract
Active learning is a machine learning technique that selects the most informative samples for labeling and uses them as training data. It has been widely explored in multimedia research community for its capability of reducing human annotation effort. In this article, we provide a survey on the efforts of leveraging active learning in multimedia annotation and retrieval. We mainly focus on two application domains: image/video annotation and content-based image retrieval. We first briefly introduce the principle of active learning and then we analyze the sample selection criteria. We categorize the existing sample selection strategies used in multimedia annotation and retrieval into five criteria:
risk reduction
,
uncertainty
,
diversity
,
density
and
relevance
. We then introduce several classification models used in active learning-based multimedia annotation and retrieval, including semi-supervised learning, multilabel learning and multiple instance learning. We also provide a discussion on several future trends in this research direction. In particular, we discuss cost analysis of human annotation and large-scale interactive multimedia annotation.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Reference86 articles.
1. Labeling images with a computer game
2. Peekaboom
3. Angluin D. 1998. Queries and concept learning. Mach. Learn. 2. 10.1023/A:1022821128753 Angluin D. 1998. Queries and concept learning. Mach. Learn. 2. 10.1023/A:1022821128753
Cited by
164 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献