Abstract
Background
Preterm birth (PTB), a common pregnancy complication, is responsible for 35% of the 3.1 million pregnancy-related deaths each year and significantly affects around 15 million children annually worldwide. Conventional approaches to predict PTB lack reliable predictive power, leaving >50% of cases undetected. Recently, machine learning (ML) models have shown potential as an appropriate complementary approach for PTB prediction using health records (HRs).
Objective
This study aimed to systematically review the literature concerned with PTB prediction using HR data and the ML approach.
Methods
This systematic review was conducted in accordance with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement. A comprehensive search was performed in 7 bibliographic databases until May 15, 2021. The quality of the studies was assessed, and descriptive information, including descriptive characteristics of the data, ML modeling processes, and model performance, was extracted and reported.
Results
A total of 732 papers were screened through title and abstract. Of these 732 studies, 23 (3.1%) were screened by full text, resulting in 13 (1.8%) papers that met the inclusion criteria. The sample size varied from a minimum value of 274 to a maximum of 1,400,000. The time length for which data were extracted varied from 1 to 11 years, and the oldest and newest data were related to 1988 and 2018, respectively. Population, data set, and ML models’ characteristics were assessed, and the performance of the model was often reported based on metrics such as accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve.
Conclusions
Various ML models used for different HR data indicated potential for PTB prediction. However, evaluation metrics, software and package used, data size and type, selected features, and importantly data management method often remain unjustified, threatening the reliability, performance, and internal or external validity of the model. To understand the usefulness of ML in covering the existing gap, future studies are also suggested to compare it with a conventional method on the same data set.
Subject
Health Information Management,Health Informatics
Reference32 articles.
1. Global burden of preterm birth
2. Preterm birthWorld Health Organization20182022-01-21https://www.who.int/news-room/fact-sheets/detail/preterm-birth
3. TranTLuoWPhungDMorrisJRickardKVenkateshSPreterm birth prediction: deriving stable and interpretable rules from high dimensional dataProceedings of the 1st Machine Learning for Healthcare Conference2016PMLR '16August 19-20, 2016Los Angeles, CA, USA16477
4. Good clinical practice advice: Prediction of preterm labor and preterm premature rupture of membranes
5. Estimating Recurrence of Spontaneous Preterm Delivery