An Ensemble Models for the Prediction of Sickle Cell Disease from Erythrocytes Smears
-
Published:2023-09-19
Issue:
Volume:9
Page:
-
ISSN:2411-7145
-
Container-title:EAI Endorsed Transactions on Pervasive Health and Technology
-
language:
-
Short-container-title:EAI Endorsed Trans Perv Health Tech
Author:
Ayoade Oluwafisayo BabatopeORCID, Oladele Tinuke Omolewa, Lucky Imoize Agbotiname, Adeloye Jerome Adetoye, Awotunde Joseph Bambidele, Olorunyomi Segun Omotayo, Faboya Oulsola Theophilius, Idowu Ayorinde Oladele
Abstract
INTRODUCTION: The human blood as a collection of tissues containing Red Blood Cells (RBCs), circular in shape and acting as an oxygen carrier, are frequently deformed by multiple blood diseases inherited from parents. These hereditary diseases of blood involve abnormal haemoglobin (Hb) or anemia which are major public health issues. Sickle Cell Disease (SCD) is one of the common non-communicable disease and genetic disorder due to changes in hematological conditions of the RBCs which often causes the inheritance of mutant Hb genes by the patient..
OBJECTIVES: The process of manual valuation, predictions and diagnosis of SCD necessitate for a passionate time spending and if not done properly can lead to wrong predictions and diagnosis. Machine Learning (ML), a branch of AI which emphases on building systems that improve performance based on the data they consume is appropriate. Despite previous research efforts in predicting with single ML algorithm, the existing systems still suffer from high false and wrong predictions.
METHODS: Thus, this paper aimed at performing comparative analysis of individual ML algorithms and their ensemble models for effective predictions of SCD (elongated shapes) in erythrocytes blood cells. Three ML algorithms were selected, and ensemble models were developed to perform the predictions and metrics were used to evaluate the performance of the model using accuracy, sensitivity, Receiver Operating Characteristics-Area under Curve (ROC-AUC) and F1 score metrics. The results were compared with existing literature for model(s) with the best prediction metrics performance..
RESULTS: The analysis was carried out using Python programming language. Individual ML algorithms reveals that their accuracies show MLR=87%, XGBoost=90%, and RF=93%, while hybridized RF-MLR=92% and RF-XGBoost=99%. The accuracy of RF-XGBoost of 99% outperformed other individual ML algorithms and Hybrid models.
CONCLUSION: Thus, the study concluded that involving hybridized ML algorithms in medical datasets increased predictions performance as it removed the challenges of high variance, low accuracy and feature noise and biases of medical datasets. The paper concluded that ensemble classifiers should be considered to improve sickle cell disease predictions.
Publisher
European Alliance for Innovation n.o.
Subject
Health Informatics,Computer Science (miscellaneous)
Reference58 articles.
1. L. Alzubaidi, M. A. Fadhel, O. Al-shamma, and J. Zhang, “Deep Learning Models for Classification of Red Blood Cells in Microscopy Images to Aid in Sickle Cell Anemia Diagnosis,” Electron. MDPI, vol. 9, no. 427, pp. 1–18, 2020. 2. P. K. Das, S. Meher, R. Panda, and A. Abraham, “A Review of Automated Methods for the Detection of Sickle Cell Disease,” IEEE Rev. Biomed. Eng., vol. 13, pp. 309–324, 2020, doi: 10.1109/RBME.2019.2917780. 3. P. L. Stephenson, M. V. Taylor, and C. Anglin, “Sickle Cell Disease,” J. Consum. Health Internet, vol. 19, no. 2, pp. 122–131, 2015, doi: 10.1080/15398285.2015.1026706. 4. M. W. Darlison and B. Modell, “Sickle-cell disorders: limits of descriptive epidemiology.,” Lancet (London, England), vol. 381, no. 9861, pp. 98–9, Jan. 2013, doi: 10.1016/S0140-6736(12)61817-0. 5. J. Kanter and R. Kruse-Jarres, “Management of sickle cell disease from childhood through adulthood.,” Blood Rev., vol. 27, no. 6, pp. 279–87, Nov. 2013, doi: 10.1016/j.blre.2013.09.001.
|
|