A survey on semi-supervised learning-Reference-Cited by-同舟云学术

A survey on semi-supervised learning

Published:2019-11-15 Issue:2 Volume:109 Page:373-440
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

van Engelen Jesper E.^ORCID,Hoos Holger H.^ORCID

Abstract

AbstractSemi-supervised learning is the branch of machine learning concerned with using labelled as well as unlabelled data to perform certain learning tasks. Conceptually situated between supervised and unsupervised learning, it permits harnessing the large amounts of unlabelled data available in many use cases in combination with typically smaller sets of labelled data. In recent years, research in this area has followed the general trends observed in machine learning, with much attention directed at neural network-based models and generative learning. The literature on the topic has also expanded in volume and scope, now encompassing a broad spectrum of theory, algorithms and applications. However, no recent surveys exist to collect and organize this knowledge, impeding the ability of researchers and engineers alike to utilize it. Filling this void, we present an up-to-date overview of semi-supervised learning methods, covering earlier work as well as more recent advances. We focus primarily on semi-supervised classification, where the large majority of semi-supervised learning research takes place. Our survey aims to provide researchers and practitioners new to the field as well as more advanced readers with a solid understanding of the main approaches and algorithms developed over the past two decades, with an emphasis on the most prominent and currently relevant work. Furthermore, we propose a new taxonomy of semi-supervised classification algorithms, which sheds light on the different conceptual and methodological approaches for incorporating unlabelled data into the training process. Lastly, we show how the fundamental assumptions underlying most semi-supervised learning algorithms are closely connected to each other, and how they relate to the well-known semi-supervised clustering assumption.

Funder

Leiden University

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-019-05855-6.pdf

Reference215 articles.

1. Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., & Isard, M., et al. (2016). Tensorflow: A system for large-scale machine learning. In 12th USENIX symposium on operating systems design and implementation (OSDI 16) (pp. 265–283).

2. Abney, S. (2002). Bootstrapping. In Proceedings of the 40th annual meeting on association for computational linguistics, association for computational linguistics (pp. 360–367).

3. Anderberg, M. R. (1973). Cluster analysis for applications. Cambridge: Academic Press.

4. Azran, A. (2007). The rendezvous algorithm: Multiclass semi-supervised learning with Markov random walks. In Proceedings of the 24th international conference on machine learning (pp. 49–56).

5. Bachman, P., Alsharif, O., & Precup, D. (2014). Learning with pseudo-ensembles. In Advances in neural information processing systems (pp. 3365–3373).

Cited by 1483 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Anomaly and intrusion detection using deep learning for software-defined networks: A survey;Expert Systems with Applications;2024-12

2. Semi-supervised clustering guided by pairwise constraints and local density structures;Pattern Recognition;2024-12

3. Graph-based semi-supervised learning with non-convex graph total variation regularization;Expert Systems with Applications;2024-12

4. Leveraging a self-adaptive mean teacher model for semi-supervised multi-exposure image fusion;Information Fusion;2024-12

5. Artificial intelligence-driven real-world battery diagnostics;Energy and AI;2024-12