Affiliation:
1. Amirkabir University of Technology
Abstract
Abstract
One of the most contestable problems in online learning is concept drift. In addition, if the data stream has imbalanced data, the detection of concept drift is more difficult, especially, when drift is in minority samples. Ensemble classifiers are also effective for the data stream classification with concept drift. By adjusting the weight to every individual classifier, we can manage the concept drift and misclassification problems. Using association rule mining techniques can help in balancing datasets and detecting concept drift in the early levels. In this article, we propose an Ensemble Fuzzy association Rule-based Classifier for Imbalanced data with Concept drift (EFR-IC) to deal with imbalanced streaming data containing concept drift. EFR-IC has five advantages compared with the existing methods as follows: 1) it does not need the data from previous chunks so in terms of storage space is more economical than similar methods; 2) it is stable in stationary and nonstationary environments; 3) due to the synchronization of all steps of algorithm execution -handling imbalanced data, concept drift detection, classification- execution speed is much better than similar methods; 4) it can be adapted to the new condition when swapping majority class to minority class; 5) it can timely react to multiple kinds of concept drifts. Experiments on both real and synthetic datasets containing concept drift show the effectiveness of EFR-IC in learning nonstationary imbalanced data sets.
Publisher
Research Square Platform LLC
Reference44 articles.
1. A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning;Alcalá-Fdez J;IEEE Trans Fuzzy Syst,2011
2. A fuzzy association rule-based classifier for imbalanced classification problems;Sanz J;" Inform Sci,2021
3. A systematic study of online class imbalance learning with concept drift;Wang S;IEEE Trans Neural Netw Learn Syst,2018
4. Gao J, Fan W, Han J, Yu PS (2007) "A general framework for mining concept-drifting data streams with skewed distributions.," in In Proceedings of the siam international conference on data mining, 2007
5. Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift., In IJCAI;Lu Y,2017