Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach-Reference-Cited by-同舟云学术

Classification Models for COVID-19 Test Prioritization in Brazil: Machine Learning Approach

Published:2021-04-08 Issue:4 Volume:23 Page:e27293
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Viana dos Santos Santana Íris^ORCID,CM da Silveira Andressa^ORCID,Sobrinho Álvaro^ORCID,Chaves e Silva Lenardo^ORCID,Dias da Silva Leandro^ORCID,Santos Danilo F S^ORCID,Gurjão Edmar C^ORCID,Perkusich Angelo^ORCID

Abstract

Background Controlling the COVID-19 outbreak in Brazil is a challenge due to the population’s size and urban density, inefficient maintenance of social distancing and testing strategies, and limited availability of testing resources. Objective The purpose of this study is to effectively prioritize patients who are symptomatic for testing to assist early COVID-19 detection in Brazil, addressing problems related to inefficient testing and control strategies. Methods Raw data from 55,676 Brazilians were preprocessed, and the chi-square test was used to confirm the relevance of the following features: gender, health professional, fever, sore throat, dyspnea, olfactory disorders, cough, coryza, taste disorders, and headache. Classification models were implemented relying on preprocessed data sets; supervised learning; and the algorithms multilayer perceptron (MLP), gradient boosting machine (GBM), decision tree (DT), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), support vector machine (SVM), and logistic regression (LR). The models’ performances were analyzed using 10-fold cross-validation, classification metrics, and the Friedman and Nemenyi statistical tests. The permutation feature importance method was applied for ranking the features used by the classification models with the highest performances. Results Gender, fever, and dyspnea were among the highest-ranked features used by the classification models. The comparative analysis presents MLP, GBM, DT, RF, XGBoost, and SVM as the highest performance models with similar results. KNN and LR were outperformed by the other algorithms. Applying the easy interpretability as an additional comparison criterion, the DT was considered the most suitable model. Conclusions The DT classification model can effectively (with a mean accuracy≥89.12%) assist COVID-19 test prioritization in Brazil. The model can be applied to recommend the prioritizing of a patient who is symptomatic for COVID-19 testing.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference42 articles.

1. Precision diagnosis: a view of the clinical decision support systems (CDSS) landscape through the lens of critical care

2. A hybrid model of Internet of Things and cloud computing to manage big data in health services applications

3. eHealth Initiatives for The Promotion of Healthy Lifestyle and Allied Implementation Difficulties

4. Machine learning-based prediction of COVID-19 diagnosis based on symptoms

5. Knowledge About COVID-19 in Brazil: Cross-Sectional Web-Based Study

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Applied to the Analysis of Prolonged COVID Symptoms: An Analytical Review;Informatics;2024-07-18

2. A NOVEL COVID-19 CLASSIFICATION METHOD BASED ON CURE CLUSTERING;Scientific Journal of Mehmet Akif Ersoy University;2024-06-30

3. Radiomics models to predict bone marrow metastasis of neuroblastoma using CT;Cancer Innovation;2024-06-28

4. Coloured Petri Nets Modeling Multilayer Perceptron Neural Networks;2024 IEEE International Conference on Consumer Electronics (ICCE);2024-01-06

5. Severity prediction in COVID-19 patients using clinical markers and explainable artificial intelligence: A stacked ensemble machine learning approach;Intelligent Decision Technologies;2023-11-20