EFR-IC: Ensemble Fuzzy association Rule-based classifier for Imbalanced data streams with Concept drift

Author:

Roshanfekr Saeideh1,Razzazi Mohammad Reza1

Affiliation:

1. Amirkabir University of Technology

Abstract

Abstract One of the most contestable problems in online learning is concept drift. In addition, if the data stream has imbalanced data, the detection of concept drift is more difficult, especially, when drift is in minority samples. Ensemble classifiers are also effective for the data stream classification with concept drift. By adjusting the weight to every individual classifier, we can manage the concept drift and misclassification problems. Using association rule mining techniques can help in balancing datasets and detecting concept drift in the early levels. In this article, we propose an Ensemble Fuzzy association Rule-based Classifier for Imbalanced data with Concept drift (EFR-IC) to deal with imbalanced streaming data containing concept drift. EFR-IC has five advantages compared with the existing methods as follows: 1) it does not need the data from previous chunks so in terms of storage space is more economical than similar methods; 2) it is stable in stationary and nonstationary environments; 3) due to the synchronization of all steps of algorithm execution -handling imbalanced data, concept drift detection, classification- execution speed is much better than similar methods; 4) it can be adapted to the new condition when swapping majority class to minority class; 5) it can timely react to multiple kinds of concept drifts. Experiments on both real and synthetic datasets containing concept drift show the effectiveness of EFR-IC in learning nonstationary imbalanced data sets.

Publisher

Research Square Platform LLC

Reference44 articles.

1. A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning;Alcalá-Fdez J;IEEE Trans Fuzzy Syst,2011

2. A fuzzy association rule-based classifier for imbalanced classification problems;Sanz J;" Inform Sci,2021

3. A systematic study of online class imbalance learning with concept drift;Wang S;IEEE Trans Neural Netw Learn Syst,2018

4. Gao J, Fan W, Han J, Yu PS (2007) "A general framework for mining concept-drifting data streams with skewed distributions.," in In Proceedings of the siam international conference on data mining, 2007

5. Dynamic Weighted Majority for Incremental Learning of Imbalanced Data Streams with Concept Drift., In IJCAI;Lu Y,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3