Assessing the Performance of Machine Learning Methods Trained on Public Health Observational Data: A Case Study From COVID‐19

Author:

Pigoli Davide12ORCID,Baker Kieran12,Budd Jobie3,Butler Lorraine4,Coppock Harry25,Egglestone Sabrina4,Gilmour Steven G.12,Holmes Chris26ORCID,Hurley David4,Jersakova Radka2,Kiskin Ivan7,Koutra Vasiliki12,Mellor Jonathon4,Nicholson George26,Packham Joe4,Patel Selina34,Payne Richard4,Roberts Stephen J.8,Schuller Björn W.25ORCID,Tendero‐Cañadas Ana49,Thornley Tracey10ORCID,Titcomb Alexander4

Affiliation:

1. Department of Mathematics King's College London UK

2. The Alan Turing Institute London UK

3. Division of Medicine University College London UK

4. UK Health Security Agency London UK

5. Group on Language Audio & Music Imperial College London UK

6. Department of Statistics University of Oxford UK

7. Centre for Vision, Speech and Signal Processing University of Surrey UK

8. Department of Engineering Science University of Oxford UK

9. Centre for Lifelong Health University of Brighton UK

10. Pharmacy Practice and Policy Division University of Nottingham UK

Abstract

ABSTRACTFrom early in the coronavirus disease 2019 (COVID‐19) pandemic, there was interest in using machine learning methods to predict COVID‐19 infection status based on vocal audio signals, for example, cough recordings. However, early studies had limitations in terms of data collection and of how the performances of the proposed predictive models were assessed. This article describes how these limitations have been overcome in a study carried out by the Turing‐RSS Health Data Laboratory and the UK Health Security Agency. As part of the study, the UK Health Security Agency collected a dataset of acoustic recordings, SARS‐CoV‐2 infection status and extensive study participant meta‐data. This allowed us to rigorously assess state‐of‐the‐art machine learning techniques to predict SARS‐CoV‐2 infection status based on vocal audio signals. The lessons learned from this project should inform future studies on statistical evaluation methods to assess the performance of machine learning techniques for public health tasks.

Funder

Royal Statistical Society

Alan Turing Institute

NIHR Oxford Biomedical Research Centre

Publisher

Wiley

Reference18 articles.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3