Author:
Kou Ai-Jun,Huang Xu,Sun Wen-Xue
Abstract
AbstractConcept drift (CD) in data streams can significantly impact the performance and stability of data stream classification algorithms, diminishing the generalization capabilities of integrated learning models. This paper addresses CD issues in dichotomous data streams by introducing a novel modeling approach that leverages evolutionary computation techniques. The method entails grouping base learners based on their performance within a sliding window and then evolving the base learning periods using evolutionary techniques. Furthermore, the concept of “gene flow” is introduced to enhance diversity among base learners and improve CD prediction performance. Experimental results on real and artificial datasets demonstrate the superior comprehensive performance of the proposed method. Specifically, the BCDECA algorithm outperforms other similar methods, excelling in accuracy, diversity, convergence rate, and robustness on a range of datasets.
Publisher
Springer Science and Business Media LLC
Reference34 articles.
1. Lu J, Liu A, Dong F, et al. Learning under concept drift: a review. IEEE Trans Knowl Data Eng. 2018;31(12):2346–63.
2. Gama J, Žliobaitė I, Bifet A, et al. A survey on concept drift adaptation. ACM Comput Surv. 2014;46(4):1–37.
3. Žliobaitė I, Pechenizkiy M, Gama J. An overview of concept drift applications. Big data analysis: new algorithms for a new society, 2016: 91-114.
4. Dries A, Rückert U. Adaptive concept drift detection. Stat Anal Data Mining. 2009;2(5–6):311–27.
5. Tsymbal A, Pechenizkiy M, Cunningham P, et al. Dynamic integration of classifiers for handling concept drift. Information Fusion. 2008;9(1):56–68.