Calibration of medical diagnostic classifier scores to the probability of disease-Reference-Cited by-同舟云学术

Calibration of medical diagnostic classifier scores to the probability of disease

Published:2016-08-08 Issue:5 Volume:27 Page:1394-1409
ISSN:0962-2802
Container-title:Statistical Methods in Medical Research
language:en
Short-container-title:Stat Methods Med Res

Author:

Chen Weijie¹,Sahiner Berkman¹,Samuelson Frank¹,Pezeshk Aria¹,Petrick Nicholas¹

Affiliation:

1. Office of Science and Engineering Laboratories, Center for Devices and Radiological Health, Food and Drug Administration, Silver Spring, USA

Abstract

Scores produced by statistical classifiers in many clinical decision support systems and other medical diagnostic devices are generally on an arbitrary scale, so the clinical meaning of these scores is unclear. Calibration of classifier scores to a meaningful scale such as the probability of disease is potentially useful when such scores are used by a physician. In this work, we investigated three methods (parametric, semi-parametric, and non-parametric) for calibrating classifier scores to the probability of disease scale and developed uncertainty estimation techniques for these methods. We showed that classifier scores on arbitrary scales can be calibrated to the probability of disease scale without affecting their discrimination performance. With a finite dataset to train the calibration function, it is important to accompany the probability estimate with its confidence interval. Our simulations indicate that, when a dataset used for finding the transformation for calibration is also used for estimating the performance of calibration, the resubstitution bias exists for a performance metric involving the truth states in evaluating the calibration performance. However, the bias is small for the parametric and semi-parametric methods when the sample size is moderate to large (>100 per class).

Publisher

SAGE Publications

Subject

Health Information Management,Statistics and Probability,Epidemiology

Link

http://journals.sagepub.com/doi/pdf/10.1177/0962280216661371

Reference37 articles.

1. Computer-aided diagnosis in medical imaging: Historical review, current status and future potential

2. The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models

3. Projecting Individualized Probabilities of Developing Breast Cancer for White Females Who Are Being Examined Annually

4. Score normalization in multimodal biometric systems

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Calibrating machine learning approaches for probability estimation: A comprehensive comparison;Statistics in Medicine;2023-10-17

2. Machine Learning Risk Prediction Model of 90-day Mortality After Gastrectomy for Cancer;Annals of Surgery;2022-07-22

3. Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare;npj Digital Medicine;2022-05-31

4. Generating diagnostic profiles of cognitive decline and dementia using magnetoencephalography;Neurobiology of Aging;2022-03

5. Machine Learning Testing: Survey, Landscapes and Horizons;IEEE Transactions on Software Engineering;2022-01-01