Affiliation:
1. Department of Computer Science, Shaheed Zulfikar Ali Bhutto Institute of Science and Technology (SZABIST) , Karachi , Pakistan
2. Department of Computer Science, Millennium Institute of Technology & Entrepreneurship (MiTE) , Karachi , Pakistan
3. Department of Computing and IT (DOCIT), The Millennium Universal College (TMUC) , Islamabad 44000 , Pakistan
4. Faculty of Computer Science, Asghar Mall College Rawalpindi, HED , Govt. of Punjab , Pakistan
5. Department of Economics, Kabridahar University , Po Box 250 , Somali , Ethiopia
Abstract
Abstract
In developing countries, child health and restraining under-five child mortality are one of the fundamental concerns. UNICEF adopted sustainable development goal 3 (SDG3) to reduce the under-five child mortality rate globally to 25 deaths per 1,000 live births. The under-five mortality rate is 69 deaths per 1,000 live child-births in Pakistan as reported by the Demographic and Health Survey (2018). Predictive analytics has the power to transform the healthcare industry, personalizing care for every individual. Pakistan Demographic Health Survey (2017–2018), the publicly available dataset, is used in this study and multiple imputation methods are adopted for the treatment of missing values. The information gain, a feature selection method, ranked the information-rich features and examine their impact on child mortality prediction. The synthetic minority over-sampling method (SMOTE) balanced the training dataset, and four supervised machine learning classifiers have been used, namely the decision tree classifier, random forest classifier, naive Bayes classifier, and extreme gradient boosting classifier. For comparative analysis, accuracy, precision, recall, and F1-score have been used. Eventually, a predictive analytics framework is built that predicts whether the child is alive or dead. The number under-five children in a household, preceding birth interval, family members, mother age, age of mother at first birth, antenatal care visits, breastfeeding, child size at birth, and place of delivery were found to be critical risk factors for child mortality. The random forest classifier performed efficiently and predicted under-five child mortality with accuracy (93.8%), precision (0.964), recall (0.971), and F1-score (0.967). The findings could greatly assist child health intervention programs in decision-making.
Subject
General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Neuroscience
Reference30 articles.
1. Lozano R, Fullman N, Abate D, Abay SM, Abbafati C, Abbasi N, et al. Measuring progress from 1990 to 2017 and projecting attainment to 2030 of the health-related Sustainable Development Goals for 195 countries and territories: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):2091–138.
2. Bhutta ZA, Hafeez A, Rizvi A, Ali N, Khan A, Ahmad F, et al. Reproductive, maternal, newborn, and child health in Pakistan: challenges and opportunities. Lancet. 2013 Jun 22;381(9884):2207–18.
3. Patel KK, Rai R, Rai AK. Determinants of infant mortality in Pakistan: evidence from Pakistan Demographic and Health Survey 2017–18. J Public Health. 2021 Jun;29:693–701.
4. Nisar YB, Dibley MJ. Determinants of neonatal mortality in Pakistan: secondary analysis of Pakistan Demographic and Health Survey 2006–07. BMC Public Health. 2014 Dec;14:1–2.
5. Podda M, Bacciu D, Micheli A, Bellu R, Placidi G, Gagliardi L. A machine learning approach to estimating preterm infants survival: development of the Preterm Infants Survival Assessment (PISA) predictor. Sci Rep. 2018 Sep 13;8(1):13743.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献