Author:
Cui Chaohua,Lan Jiaona,Lao Zhenxian,Xia Tianyu,Long Tonghua
Abstract
BackgroundRecurrence can worsen conditions and increase mortality in ICH patients. Predicting the recurrence risk and preventing or treating these patients is a rational strategy to improve outcomes potentially. A machine learning model with improved performance is necessary to predict recurrence.MethodsWe collected data from ICH patients in two hospitals for our retrospective training cohort and prospective testing cohort. The outcome was the recurrence within one year. We constructed logistic regression, support vector machine (SVM), decision trees, Voting Classifier, random forest, and XGBoost models for prediction.ResultsThe model included age, NIHSS score at discharge, hematoma volume at admission and discharge, PLT, AST, and CRP levels at admission, use of hypotensive drugs and history of stroke. In internal validation, logistic regression demonstrated an AUC of 0.89 and precision of 0.81, SVM showed an AUC of 0.93 and precision of 0.90, the random forest achieved an AUC of 0.95 and precision of 0.93, and XGBoost scored an AUC of 0.95 and precision of 0.92. In external validation, logistic regression achieved an AUC of 0.81 and precision of 0.79, SVM obtained an AUC of 0.87 and precision of 0.76, the random forest reached an AUC of 0.92 and precision of 0.86, and XGBoost recorded an AUC of 0.93 and precision of 0.91.ConclusionThe machine learning models performed better in predicting ICH recurrence than traditional statistical models. The XGBoost model demonstrated the best comprehensive performance for predicting ICH recurrence in the external testing cohort.