Affiliation:
1. Center for Applied Data Science Gütersloh (CfADS), FH Bielefeld-University of Applied Sciences, 33619 Bielefeld, Germany
Abstract
Despite the availability and ease of collecting a large amount of free, unlabeled data, the expensive and time-consuming labeling process is still an obstacle to labeling a sufficient amount of training data, which is essential for building supervised learning models. Here, with low labeling cost, the active learning (AL) technique could be a solution, whereby a few, high-quality data points are queried by searching for the most informative and representative points within the instance space. This strategy ensures high generalizability across the space and improves classification performance on data we have never seen before. In this paper, we provide a survey of recent studies on active learning in the context of classification. This survey starts with an introduction to the theoretical background of the AL technique, AL scenarios, AL components supported with visual explanations, and illustrative examples to explain how AL simply works and the benefits of using AL. In addition to an overview of the query strategies for the classification scenarios, this survey provides a high-level summary to explain various practical challenges with AL in real-world settings; it also explains how AL can be combined with various research areas. Finally, the most commonly used AL software packages and experimental evaluation metrics with AL are also discussed.
Funder
SustAInable Lifecycle of Intelligent Socio-Technical Systems
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference220 articles.
1. Mitchell, T. (1997). Machine Learning, McGraw-Hill.
2. Generalizing from a few examples: A survey on few-shot learning;Wang;ACM Comput. Surv. (CSUR),2020
3. Settles, B. (2009). Active Learning Literature Survey, Department of Computer Sciences, University of Wisconsin-Madison. Computer Sciences Technical Report.
4. Improving generalization with active learning;Cohn;Mach. Learn.,1994
5. Active learning through label error statistical methods;Wang;Knowl.-Based Syst.,2020
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献