Machine Learning-Based COVID-19 Diagnosis by Demographic Characteristics and Clinical Data

Author:

Gorji Fatemeh,Shafiekhani Sajad,Namdar Peyman,Abdollahzade Sina,Rafiei Sima

Abstract

Introduction: To facilitate rapid and effective diagnosis of COVID-19, effective screening can alleviate the challenges facing healthcare systems. We aimed to develop a machine learning-based prediction of COVID-19 diagnosis and design a graphical user interface (GUI) to diagnose COVID-19 cases by recording their symptoms and demographic features. Methods: We imple-mented different classification models including support vector machine (SVM), Decision tree (DT), Naïve Bayes (NB) and K-nearest neighbor (KNN) to predict the result of COVID-19 test for individ-uals. We trained these models by data of 16973 individuals (90% of all individuals included in data gathering) and tested by 1885 individuals (10% of all individuals). Maximum relevance minimum redundancy (MRMR) algorithms used to score features for prediction of result of COVID-19 test. A user-friendly GUI was designed to predict COVID-19 test results in individuals. Results: Study re-sults revealed that coughing had the highest positive correlation with the positive results of COVID-19 test followed by the duration of having COVID-19 signs and symptoms, exposure to infected individuals, age, muscle pain, recent infection by COVID-19 virus, fever, respiratory distress, loss of smell or taste, nausea, anorexia, headache, vertigo, CT symptoms in lung scans, diabetes and hyper-tension. The values of accuracy, precision, recall, F1-score, specificity and area under receiver oper-ating curve (AUROC) of different classification models computed in different setting of features scored by MRMR algorithm. Finally, our designed GUI by receiving each of the 42 features and symptoms from the users and through selecting one of the SVM, KNN, Naïve Bayes and decision tree models, predict the result of COVID-19 test. The accuracy, AUROC and F1-score of SVM model as the best model for diagnosis of COVID-19 test were 0.7048 (95% CI: 0.6998, 0.7094), 0.7045 (95% CI: 0.7003, 0.7104) and 0.7157 (95% CI: 0.7043, 0.7194), respectively. Conclusion: In this study we implemented a machine learning approach to facilitate early clinical decision making during COVID-19 outbreak and provide a predictive model of COVID-19 diagnosis capable of categorizing populations in to infected and non-infected individuals the same as an efficient screening tool.

Publisher

MDPI AG

Subject

Pulmonary and Respiratory Medicine

Cited by 8 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3