Sounds of COVID-19: exploring realistic performance of audio-based digital testing-Reference-Cited by-同舟云学术

Sounds of COVID-19: exploring realistic performance of audio-based digital testing

Published:2022-01-28 Issue:1 Volume:5 Page:
ISSN:2398-6352
Container-title:npj Digital Medicine
language:en
Short-container-title:npj Digit. Med.

Author:

Han Jing^ORCID,Xia Tong^ORCID,Spathis Dimitris^ORCID,Bondareva Erika,Brown Chloë,Chauhan Jagmohan,Dang Ting,Grammenos Andreas,Hasthanasombat Apinan,Floto Andres,Cicuta Pietro^ORCID,Mascolo Cecilia

Abstract

AbstractTo identify Coronavirus disease (COVID-19) cases efficiently, affordably, and at scale, recent work has shown how audio (including cough, breathing and voice) based approaches can be used for testing. However, there is a lack of exploration of how biases and methodological decisions impact these tools’ performance in practice. In this paper, we explore the realistic performance of audio-based digital testing of COVID-19. To investigate this, we collected a large crowdsourced respiratory audio dataset through a mobile app, alongside symptoms and COVID-19 test results. Within the collected dataset, we selected 5240 samples from 2478 English-speaking participants and split them into participant-independent sets for model development and validation. In addition to controlling the language, we also balanced demographics for model training to avoid potential acoustic bias. We used these audio samples to construct an audio-based COVID-19 prediction model. The unbiased model took features extracted from breathing, coughs and voice signals as predictors and yielded an AUC-ROC of 0.71 (95% CI: 0.65–0.77). We further explored several scenarios with different types of unbalanced data distributions to demonstrate how biases and participant splits affect the performance. With these different, but less appropriate, evaluation strategies, the performance could be overestimated, reaching an AUC up to 0.90 (95% CI: 0.85–0.95) in some circumstances. We found that an unrealistic experimental setting can result in misleading, sometimes over-optimistic, performance. Instead, we reported complete and reliable results on crowd-sourced data, which would allow medical professionals and policy makers to accurately assess the value of this technology and facilitate its deployment.

Publisher

Springer Science and Business Media LLC

Subject

Health Information Management,Health Informatics,Computer Science Applications,Medicine (miscellaneous)

Link

https://www.nature.com/articles/s41746-021-00553-x.pdf

Reference38 articles.

1. Cevik, M., Kuppalli, K., Kindrachuk, J. & Peiris, M. Virology, transmission, and pathogenesis of SARS-CoV-2. Br. Med. J. 371, 1–6 (2020).

2. Vogels, C. B. et al. Analytical sensitivity and efficiency comparisons of SARS-CoV-2 RT–qPCR primer–probe sets. Nat. Microbiol. 5, 1299–1305 (2020).

3. Garg, A. et al. Evaluation of seven commercial RT-PCR kits for COVID-19 testing in pooled clinical specimens. J. Med. Virol. https://doi.org/10.1002/jmv.26691 (2020).

4. Liu, R. et al. Positive rate of RT-PCR detection of SARS-CoV-2 infection in 4880 cases from one hospital in Wuhan, China, from Jan to Feb 2020. Clin. Chim. Acta 505, 172–175 (2020).

5. van Kasteren, P. B. et al. Comparison of seven commercial RT-PCR diagnostic kits for COVID-19. J. Clin. Virol. https://doi.org/10.1016/j.jcv.2020.104412 (2020).

Cited by 56 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Digital voice-based biomarker for monitoring respiratory quality of life: findings from the colive voice study;Biomedical Signal Processing and Control;2024-10

2. Novel audio characteristic-dependent feature extraction and data augmentation methods for cough-based respiratory disease classification;Computers in Biology and Medicine;2024-09

3. Developing a multi-variate prediction model for COVID-19 from crowd-sourced respiratory voice data;Exploration of Digital Health Technologies;2024-08-11

4. Enhancing Water-Deficient Potato Plant Identification: Assessing Realistic Performance of Attention-Based Deep Neural Networks and Hyperspectral Imaging for Agricultural Applications;Plants;2024-07-11

5. CoBrS: Cough Breath Segmentation for the Reduction of Class-Confounding Characteristics in Dataset Curation;2024 IEEE International Symposium on Medical Measurements and Applications (MeMeA);2024-06-26