Author:
Xu Ming,Ouyang Liu,Gao Yan,Chen Yuanfang,Yu Tingting,Li Qian,Sun Kai,Bao Forrest Sheng,Safarnejad Lida,Wen Jing,Jiang Chao,Chen Tianyang,Han Lei,Zhang Hengdong,Gao Yue,Yu Zhengmin,Liu Xiaowen,Yan Tianyu,Li Hebi,Robinson Patrick,Zhu Baoli,Liu Jie,Liu Yang,Zhang Zengli,Ge Yaorong,Chen Shi
Abstract
AbstractEffectively identifying COVID-19 patients using non-PCR clinical data is critical for the optimal clinical outcomes. Currently, there is a lack of comprehensive understanding of various biomedical features and appropriate technical approaches to accurately detecting COVID-19 patients. In this study, we recruited 214 confirmed COVID-19 patients in non-severe (NS) and 148 in severe (S) clinical type, 198 non-infected healthy (H) participants and 129 non-COVID viral pneumonia (V) patients. The participants’ clinical information (23 features), lab testing results (10 features), and thoracic CT scans upon admission were acquired as three input feature modalities. To enable late fusion of multimodality data, we developed a deep learning model to extract a 10-feature high-level representation of the CT scans. Exploratory analyses showed substantial differences of all features among the four classes. Three machine learning models (k-nearest neighbor kNN, random forest RF, and support vector machine SVM) were developed based on the 43 features combined from all three modalities to differentiate four classes (NS, S, V, and H) at once. All three models had high accuracy to differentiate the overall four classes (95.4%-97.7%) and each individual class (90.6%-99.9%). Multimodal features provided substantial performance gain from using any single feature modality. Compared to existing binary classification benchmarks often focusing on single feature modality, this study provided a novel and effective breakthrough for clinical applications. Findings and the analytical workflow can be used as clinical decision support for current COVID-19 and other clinical applications with high-dimensional multimodal biomedical features.One sentence summaryWe trained and validated late fusion deep learning-machine learning models to predict non-severe COVID-19, severe COVID-19, non-COVID viral infection, and healthy classes from clinical, lab testing, and CT scan features extracted from convolutional neural network and achieved predictive accuracy of > 96% to differentiate all four classes at once based on a large dataset of 689 participants.
Publisher
Cold Spring Harbor Laboratory
Reference52 articles.
1. World Health Organization. Coronavirus Disease 2019 (COVID-19) Situation Report 203, 10 Aug 2020 (2020). https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200810-covid-19-sitrep-203.pdf?sfvrsn=aa050308_2
2. National Health Commission of China. Novel coronavirus pneumonia diagnosis and treatment plan. 4 Mar 2020 (2020). http://www.nhc.gov.cn/yzygj/s7652m/202003/a31191442e29474b98bfed5579d5af95.shtml
3. Evolving status of the 2019 novel coronavirus infection: Proposal of conventional serologic assays for disease diagnosis and infection monitoring
4. Combination of RT‐qPCR testing and clinical features for diagnosis of COVID‐19 facilitates management of SARS‐CoV‐2 outbreak
5. Systematic review with meta-analysis of the accuracy of diagnostic tests for COVID-19
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献