Author:
Avelino Juscimara G.,Cavalcanti George D. C.,Cruz Rafael M. O.
Abstract
AbstractImbalanced problems can arise in different real-world situations, and to address this, certain strategies in the form of resampling or balancing algorithms are proposed. This issue has largely been studied in the context of classification, and yet, the same problem features in regression tasks, where target values are continuous. This work presents an extensive experimental study comprising various balancing and predictive models, and wich uses metrics to capture important elements for the user and to evaluate the predictive model in an imbalanced regression data context. It also proposes a taxonomy for imbalanced regression approaches based on three crucial criteria: regression model, learning process, and evaluation metrics. The study offers new insights into the use of such strategies, highlighting the advantages they bring to each model’s learning process, and indicating directions for further studies. The code, data and further information related to the experiments performed herein can be found on GitHub: https://github.com/JusciAvelino/imbalancedRegression.
Funder
Fundação de Amparo à Ciência e Tecnologia do Estado de Pernambuco
Conselho Nacional de Desenvolvimento Científico e Tecnológico
École de technologie supérieure
Publisher
Springer Science and Business Media LLC
Reference55 articles.
1. Agrawal A, Petersen MR (2021) Detecting arsenic contamination using satellite imagery and machine learning. Toxics 9(12):333
2. Aguiar, G., Krawczyk, B., Cano, A.: A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework. arXiv preprint arXiv:2204.03719 (2022)
3. Ali H, Salleh MNM, Hussain K, Ahmad A, Ullah A, Muhammad A, Naseem R, Khan M (2019) A review on data preprocessing methods for class imbalance problem. Int J Eng Technol 8:390–397
4. Aminian E, Ribeiro RP, Gama J (2021) Chebyshev approaches for imbalanced data streams regression models. Data Min Knowl Discov 35:2389–2466
5. Bal PR, Kumar S (2018) Cross project software defect prediction using extreme learning machine: An ensemble based study. In: ICSOFT, pp. 354–361
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献