Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study-Reference-Cited by-同舟云学术

Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study

Published:2020-10-06 Issue:10 Volume:22 Page:e21299
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Munsch Nicolas^ORCID,Martin Alistair^ORCID,Gruarin Stefanie^ORCID,Nateqi Jama^ORCID,Abdarahmane Isselmou^ORCID,Weingartner-Ortner Rafael^ORCID,Knapp Bernhard^ORCID

Abstract

Background A large number of web-based COVID-19 symptom checkers and chatbots have been developed; however, anecdotal evidence suggests that their conclusions are highly variable. To our knowledge, no study has evaluated the accuracy of COVID-19 symptom checkers in a statistically rigorous manner. Objective The aim of this study is to evaluate and compare the diagnostic accuracies of web-based COVID-19 symptom checkers. Methods We identified 10 web-based COVID-19 symptom checkers, all of which were included in the study. We evaluated the COVID-19 symptom checkers by assessing 50 COVID-19 case reports alongside 410 non–COVID-19 control cases. A bootstrapping method was used to counter the unbalanced sample sizes and obtain confidence intervals (CIs). Results are reported as sensitivity, specificity, F1 score, and Matthews correlation coefficient (MCC). Results The classification task between COVID-19–positive and COVID-19–negative for “high risk” cases among the 460 test cases yielded (sorted by F1 score): Symptoma (F1=0.92, MCC=0.85), Infermedica (F1=0.80, MCC=0.61), US Centers for Disease Control and Prevention (CDC) (F1=0.71, MCC=0.30), Babylon (F1=0.70, MCC=0.29), Cleveland Clinic (F1=0.40, MCC=0.07), Providence (F1=0.40, MCC=0.05), Apple (F1=0.29, MCC=-0.10), Docyet (F1=0.27, MCC=0.29), Ada (F1=0.24, MCC=0.27) and Your.MD (F1=0.24, MCC=0.27). For “high risk” and “medium risk” combined the performance was: Symptoma (F1=0.91, MCC=0.83) Infermedica (F1=0.80, MCC=0.61), Cleveland Clinic (F1=0.76, MCC=0.47), Providence (F1=0.75, MCC=0.45), Your.MD (F1=0.72, MCC=0.33), CDC (F1=0.71, MCC=0.30), Babylon (F1=0.70, MCC=0.29), Apple (F1=0.70, MCC=0.25), Ada (F1=0.42, MCC=0.03), and Docyet (F1=0.27, MCC=0.29). Conclusions We found that the number of correctly assessed COVID-19 and control cases varies considerably between symptom checkers, with different symptom checkers showing different strengths with respect to sensitivity and specificity. A good balance between sensitivity and specificity was only achieved by two symptom checkers.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference29 articles.

1. Impact of Rumors and Misinformation on COVID-19 in Social Media

2. Evaluation of symptom checkers for self diagnosis and triage: audit study

3. Digital and online symptom checkers and assessment services for urgent care to inform a new digital platform: a systematic review

4. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis

5. Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period

Cited by 63 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparison of Two Symptom Checkers (Ada and Symptoma) in the Emergency Department: Randomized, Crossover, Head-to-Head, Double-Blinded Study;Journal of Medical Internet Research;2024-08-20

2. Evaluating the Diagnostic Performance of Symptom Checkers: Clinical Vignette Study;JMIR AI;2024-04-29

3. Multilingual Framework for Risk Assessment and Symptom Tracking (MRAST);Sensors;2024-02-08

4. Comparison of Two Symptom Checkers (Ada and Symptoma) in the Emergency Department: Randomized, Crossover, Head-to-Head, Double-Blinded Study (Preprint);2024-01-19

5. Assessment of a Digital Symptom Checker Tool's Accuracy in Suggesting Reproductive Health Conditions: Clinical Vignettes Study;JMIR mHealth and uHealth;2023-12-05