Assessing generalizability of a dengue classifier across multiple datasets

Author:

Lu Bingqian,Li Yanni,Evans CiaranORCID

Abstract

AbstractBackgroundEarly diagnosis of dengue fever is important for individual treatment and monitoring disease prevalence in the population. To assist diagnosis, previous studies have proposed classification models to detect dengue from symptoms and clinical measurements. However, there has been little exploration of whether existing models can be used to make predictions for new populations.MethodsWe trained logistic regression models on five publicly available dengue datasets from previous studies, using three explanatory variables identified as important in prior work: age, white blood cell count, and platelet count. These five datasets were collected at different times in different locations, with a variety of disease rates and patient ages. A model was trained on each dataset, and predictive performance was evaluated on both the original (training) dataset, and the other (test) datasets from different studies.ResultsIn-sample area under the receiver operating characteristic curve (AUC) values for the logistic regression models ranged from 0.74 to 0.89, while out-of-sample AUCs ranged from 0.55 to 0.89. Matching age ranges in training/test datasets increased AUC values and balanced the sensitivity and specificity. Adjusting the predicted probabilities to account for differences in dengue prevalence improved calibration in 20/28 training-test pairs.ConclusionsThe in-sample performance of the logistic regression model was consistent with previous dengue classifiers, suggesting the chosen model is a good choice in a variety of settings and has decent overall performance. However, adjustments are required to make predictions on new datasets. Practitioners can use existing dengue classifiers in new settings but should be careful with different patient ages and disease rates.Author summaryDengue fever is an acute mosquito-borne infection with a substantial and growing disease burden. Early diagnosis of dengue fever is important for treatment and disease monitoring, but gold-standard tests are not always readily available. To supplement diagnostic tests, previous studies have investigated the use of classifiers trained on common diagnostic measurements such as symptoms, blood work, and demographic variables. However, comparisons of existing methods are limited, and in particular there has been little exploration of whether existing models can be used to make predictions for new populations. Generalizability of existing models to new settings could save the substantial time and effort required to collect new data and fit a model, but this generalizability may be limited by differences between populations. In this study, we assess performance of logistic regression models on five publicly available dengue datasets from previous studies, using three explanatory variables identified as important in prior work: age, white blood cell count, and platelet count. Our results show that it can be possible for practitioners to use existing models in new settings, but care is needed to account for differences in patient demographics and dengue prevalence.

Publisher

Cold Spring Harbor Laboratory

Reference36 articles.

1. WHO Regional Office for South-East Asia. Comprehensive guideline for prevention and control of dengue and dengue haemorrhagic fever. 2011;.

2. Rapid testing requires clinical evaluation for accurate diagnosis of dengue disease: A passive surveillance study in Southern Malaysia;PLOS Neglected Tropical Diseases,2021

3. Dengue fever as an emerging disease in Afghanistan: Epidemiology of the first reported cases;International Journal of Infectious Diseases,2020

4. The global distribution and burden of dengue

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3