Assessing Machine Learning classifiers in COVID-19: The Role of Clinical, Laboratory, and Radiological Features in Predicting Oxygen Saturation

Author:

Shahidzade Mostafa1ORCID,Jafari Ramezan2,Jafari Nematollah Jonaidi3,Salmanizadegan Fateme1,Teymouri Omid4,Sabouri Maryam1,Yargholi Mahya1,Mollaahmadipour Zahra1

Affiliation:

1. Student Research Committee, Baqiyatallah University of Medical Sciences, Tehran, Iran

2. Department of Radiology, Health Research Center, Life Style Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran

3. Specialist in Infectious Diseases, Health Research Center, Life Style Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran

4. Department of Radiology, Faculty of Medical Sciences, Iran University of Medical Sciences, Tehran, Iran

Abstract

Abstract

Background Oxygen saturation is vital for evaluating COVID-19 severity in hospitalized patients, with levels below 90% indicating respiratory distress and a potential need for intensive care. Objective This study develops machine learning models that integrate CT-based features with clinical and laboratory data to predict binary oxygen saturation outcomes in COVID-19 patients. Method A retrospective study of 1008 COVID-19 patients admitted between October 2020 and May 2021, using 70% of data for training and 30% for testing. Classifiers used: Linear SVM, SVM with RBF kernels, Logistic Regression, Random Forests, Naïve Bayes, and XGBoost. Performance assessed by validation AUC and 10-fold cross-validation AUC range. Significant features identified by the top validation AUC classifier, prioritizing the top three with importance and stability scores over 0.7. Results Linear ML classifiers performed well in Clinical and Laboratory Models, while non-linear classifiers excelled in CT-Based and Integrated Models. Logistic Regression in the Clinical Model achieved an AUC of 0.82, with Age, Gender, and Fever as significant features. In the Laboratory Model, Linear SVM (0.82) identified White Blood Cell count as key. Random Forest in the CT-Based Model (0.87) highlighted Mean Lesion Volume. The Integrated Model's top classifier, SVM with RBF Kernel (0.89), found WBC and Mean NLLV critical. Conclusion Linear classifiers effectively predict oxygen saturation using clinical and laboratory data, while non-linear classifiers excel with CT-based and integrated models, highlighting the need for tailored machine learning approaches to different data types in COVID-19 patient care.

Publisher

Springer Science and Business Media LLC

Reference22 articles.

1. Prediction models for respiratory outcomes in patients with COVID-19: integration of quantitative computed tomography parameters, demographics, and laboratory features;Kang J;J Thorac Dis,2023

2. Clinical and chest CT features as a predictive tool for COVID-19 clinical progress: introducing a novel semi-quantitative scoring system;Salahshour F;Eur Radiol,2021

3. Clinical and radiological imaging as prognostic predictors in COVID-19 patients;Metwally M;Egypt J Radiol Nuclear Med,2021

4. Computed tomography severity score as a predictor of disease severity and mortality in COVID-19 patients: A systematic review and meta-analysis;Prakash J;J Med Imaging Radiat Sci,2023

5. Relationship of Computed Tomography Severity Score With Patient Characteristics and Survival in Hypoxemic COVID-19 Patients;Yanamandra U;Cureus,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3