Author:
Kalayou Mulugeta Hayelom,Kassaw Abdul-Aziz Kebede,Shiferaw Kirubel Biruk
Abstract
Abstract
Background
A dearth of studies showed that infectious diseases cause the majority of deaths among under-five children. Worldwide, Acute Respiratory Infection (ARI) continues to be the second most frequent cause of illness and mortality among children under the age of five. The paramount disease burden in developing nations, including Ethiopia, is still ARI.
Objective
This study aims to determine the magnitude and predictors of ARI among under-five children in Ethiopia using used state of the art machine learning algorithms.
Methods
Data for this study were derived from the 2016 Ethiopian Demographic and Health Survey. To predict the determinants of acute respiratory infections, we performed several experiments on ten machine learning algorithms (random forests, decision trees, support vector machines, Naïve Bayes, and K-nearest neighbors, Lasso regression, GBoost, XGboost), including one classic logistic regression model and an ensemble of the best performing models. The prediction ability of each machine-learning model was assessed using receiver operating characteristic curves, precision-recall curves, and classification metrics.
Results
The total ARI prevalence rate among 9501 under-five children in Ethiopia was 7.2%, according to the findings of the study. The overall performance of the ensemble model of SVM, GBoost, and XGBoost showed an improved performance in classifying ARI cases with an accuracy of 86%, a sensitivity of 84.6%, and an AUC-ROC of 0.87. The highest performing predictive model (the ensemble model) showed that the child’s age, history of diarrhea, wealth index, type of toilet, mother’s educational level, number of living children, mother’s occupation, and type of fuel they used were an important predicting factor for acute respiratory infection among under-five children.
Conclusion
The intricate web of factors contributing to ARI among under-five children was identified using an advanced machine learning algorithm. The child’s age, history of diarrhea, wealth index, and type of toilet were among the top factors identified using the ensemble model that registered a performance of 86% accuracy. This study stands as a testament to the potential of advanced data-driven methodologies in unraveling the complexities of ARI in low-income settings.
Publisher
Springer Science and Business Media LLC