Supervised learning in the presence of concept drift: a modelling framework-Reference-Cited by-同舟云学术

Supervised learning in the presence of concept drift: a modelling framework

Published:2021-04-27 Issue:1 Volume:34 Page:101-118
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Straat M.,Abadi F.,Kan Z.,Göpfert C.,Hammer B.,Biehl M.^ORCID

Abstract

AbstractWe present a modelling framework for the investigation of supervised learning in non-stationary environments. Specifically, we model two example types of learning systems: prototype-based learning vector quantization (LVQ) for classification and shallow, layered neural networks for regression tasks. We investigate so-called student–teacher scenarios in which the systems are trained from a stream of high-dimensional, labeled data. Properties of the target task are considered to be non-stationary due to drift processes while the training is performed. Different types of concept drift are studied, which affect the density of example inputs only, the target rule itself, or both. By applying methods from statistical physics, we develop a modelling framework for the mathematical analysis of the training dynamics in non-stationary environments. Our results show that standard LVQ algorithms are already suitable for the training in non-stationary environments to a certain extent. However, the application of weight decay as an explicit mechanism of forgetting does not improve the performance under the considered drift processes. Furthermore, we investigate gradient-based training of layered neural networks with sigmoidal activation functions and compare with the use of rectified linear units. Our findings show that the sensitivity to concept drift and the effectiveness of weight decay differs significantly between the two types of activation function.

Funder

Bundesministerium für Bildung und Forschung

Northern Netherlands Region of Smart Factories RoSF

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-021-06035-1.pdf

Reference58 articles.

1. Ade R, Desmukh P (2013) Methods for incremental learning—a survey. Int J Data Min Knowl Manag Process 3(4):119–125

2. Ahr M, Biehl M, Urbanczik R (1999) Statistical physics and practical training of soft-committee machines. Eur Phys J B 10:583–588

3. Amunts K, Grandinetti L, Lippert T, Petkov N (eds) (2014) Brain-inspired computing, second international workshop brainComp 2015. LNCS, vol 10087. Springer, Berlin

4. Barkai N, Seung H, Sompolinsky H (1993) Scaling laws in learning of classification tasks. Phys Rev Lett 70(20):L97–L103

5. Biehl M, Caticha N (2003) The statistical mechanics of on-line learning and generalization. In: Arbib M (ed) The handbook of brain theory and neural networks. MIT Press, London, pp 1095–1098

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RETRACTED: Continual Learning Approach for Continuous Data Stream Analysis in Dynamic Environments;Applied Sciences;2023-07-08

2. Metric Learning with Self-Adjusting Memory for Explaining Feature Drift;SN Computer Science;2023-05-05

3. A Hybrid Continual Learning Approach for Efficient Hierarchical Classification of IT Support Tickets in the Presence of Class Overlap;2023 IEEE International Conference on Industrial Technology (ICIT);2023-04-04

4. Multi-granularity user interest modeling and interest drift detection;Intelligent Data Analysis;2023-03-15

5. Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences;Science Advances;2022-10-21