Machine Learning Models for Predicting Adverse Pregnancy Outcomes in Pregnant Women with Systemic Lupus Erythematosus-Reference-Cited by-同舟云学术

Machine Learning Models for Predicting Adverse Pregnancy Outcomes in Pregnant Women with Systemic Lupus Erythematosus

Published:2023-02-07 Issue:4 Volume:13 Page:612
ISSN:2075-4418
Container-title:Diagnostics
language:en
Short-container-title:Diagnostics

Author:

Hao Xinyu¹,Zheng Dongying²³⁴^ORCID,Khan Muhanmmad⁵^ORCID,Wang Lixia³,Hämäläinen Timo⁴^ORCID,Cong Fengyu¹⁴⁶⁷,Xu Hongming¹⁷,Song Kedong²^ORCID

Affiliation:

1. School of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China

2. State Key Laboratory of Fine Chemicals, Dalian R&D Center for Stem Cell and Tissue Engineering, Dalian University of Technology, Dalian 116024, China

3. Department of Obstetrics and Gynecology, Second Affiliated Hospital of Dalian Medical University, Dalian 116027, China

4. Faculty of Information Technology, University of Jyvaskyla, 40014 Jyvaskyla, Finland

5. Institute of Zoology, University of the Punjab, Quaid-e-Azam Campus, Lahore 54590, Pakistan

6. School of Artificial Intelligence, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China

7. Key Laboratory of Integrated Circuit and Biomedical Electronic System, Liaoning Province, Dalian University of Technology, Dalian 116024, China

Abstract

Predicting adverse outcomes is essential for pregnant women with systemic lupus erythematosus (SLE) to minimize risks. Applying statistical analysis may be limited for the small sample size of childbearing patients, while the informative medical records could be provided. This study aimed to develop predictive models applying machine learning (ML) techniques to explore more information. We performed a retrospective analysis of 51 pregnant women exhibiting SLE, including 288 variables. After correlation analysis and feature selection, six ML models were applied to the filtered dataset. The efficiency of these overall models was evaluated by the Receiver Operating Characteristic Curve. Meanwhile, real-time models with different timespans based on gestation were also explored. Eighteen variables demonstrated statistical differences between the two groups; more than forty variables were screened out by ML variable selection strategies as contributing predictors, while the overlap of variables were the influential indicators testified by the two selection strategies. The Random Forest (RF) algorithm demonstrated the best discrimination ability under the current dataset for overall predictive models regardless of the data missing rate, while Multi-Layer Perceptron models ranked second. Meanwhile, RF achieved best performance when assessing the real-time predictive accuracy of models. ML models could compensate the limitation of statistical methods when the small sample size problem happens along with numerous variables acquired, while RF classifier performed relatively best when applied to such structured medical records.

Funder

Fundamental Research Funds for the Central Universities

Publisher

MDPI AG

Subject

Clinical Biochemistry

Link

https://www.mdpi.com/2075-4418/13/4/612/pdf

Reference38 articles.

1. Lupus Low Disease Activity State Achievement Is Important for Reducing Adverse Outcomes in Pregnant Patients with Systemic Lupus Erythematosus;Kim;J. Rheumatol.,2021

2. Predictive factors of fetal and maternal pregnancy outcomes in Japanese patients with systemic lupus erythematosus;Irino;Lupus,2021

3. Sample size for binary logistic prediction models: Beyond events per variable criteria;Moons;Stat. Methods Med. Res.,2019