Classification performance bias between training and test sets in a limited mammography dataset-Reference-Cited by-同舟云学术

Classification performance bias between training and test sets in a limited mammography dataset

Published:2024-02-07 Issue:2 Volume:19 Page:e0282402
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Hou Rui^ORCID,Lo Joseph Y.^ORCID,Marks Jeffrey R.,Hwang E. Shelley,Grimm Lars J.

Abstract

Objectives To assess the performance bias caused by sampling data into training and test sets in a mammography radiomics study. Methods Mammograms from 700 women were used to study upstaging of ductal carcinoma in situ. The dataset was repeatedly shuffled and split into training (n = 400) and test cases (n = 300) forty times. For each split, cross-validation was used for training, followed by an assessment of the test set. Logistic regression with regularization and support vector machine were used as the machine learning classifiers. For each split and classifier type, multiple models were created based on radiomics and/or clinical features. Results Area under the curve (AUC) performances varied considerably across the different data splits (e.g., radiomics regression model: train 0.58–0.70, test 0.59–0.73). Performances for regression models showed a tradeoff where better training led to worse testing and vice versa. Cross-validation over all cases reduced this variability, but required samples of 500+ cases to yield representative estimates of performance. Conclusions In medical imaging, clinical datasets are often limited to relatively small size. Models built from different training sets may not be representative of the whole dataset. Depending on the selected data split and model, performance bias could lead to inappropriate conclusions that might influence the clinical significance of the findings. Advances in knowledge Performance bias can result from model testing when using limited datasets. Optimal strategies for test set selection should be developed to ensure study conclusions are appropriate.

Funder

National Cancer Institute

DOD Breast Cancer Research Program

Breast Cancer Research Foundation

Cancer Research UK and Dutch Cancer Society

Publisher

Public Library of Science (PLoS)

Reference24 articles.

1. American Cancer Society. Breast Cancer Facts & Figures 2019–2020, Atlanta: American Cancer Society, Inc., 2019.

2. Cancer Outcomes in DCIS Patients Without Locoregional Treatment;M. D. Ryser;JNCI: Journal of the National Cancer Institute,2019

3. The natural history of low-grade ductal carcinoma in situ of the breast in women treated by biopsy only revealed over 30 years of long-term follow-up;M. E. Sanders;Cancer,2005

4. Long-term outcome of DCIS patients: p53 as a biomarker of ipsilateral recurrence;T. J. Hieken;Journal of Clinical Oncology,2011

5. Ductal carcinoma in situ at core-needle biopsy: meta-analysis of underestimation and predictors of invasive breast cancer;M. E. Brennan;Radiology,2011

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Trade-off between training and testing ratio in machine learning for medical image processing;PeerJ Computer Science;2024-09-06