Affiliation:
1. Department of Radiation Oncology Amsterdam UMC Amsterdam The Netherlands
2. Faculty of Science Universiteit van Amsterdam Amsterdam The Netherlands
Abstract
AbstractBackgroundClinical data used to train deep learning models are often not clean data. They can contain imperfections in both the imaging data and the corresponding segmentations.PurposeThis study investigates the influence of data imperfections on the performance of deep learning models for parotid gland segmentation. This was done in a controlled manner by using synthesized data. The insights this study provides may be used to make deep learning models better and more reliable.MethodsThe data were synthesized by using the clinical segmentations, creating a pseudo ground‐truth in the process. Three kinds of imperfections were simulated: incorrect segmentations, low image contrast, and artifacts in the imaging data. The severity of each imperfection was varied in five levels. Models resulting from training sets from each of the five levels were cross‐evaluated with test sets from each of the five levels.ResultsUsing synthesized data led to almost perfect parotid gland segmentation when no error was added. Lowering the quality of the parotid gland segmentations used for training substantially lowered the model performance. Additionally, lowering the image quality of the training data by decreasing the contrast or introducing artifacts made the resulting models more robust to data containing those respective kinds of data imperfection.ConclusionThis study demonstrated the importance of good‐quality segmentations for deep learning training and it shows that using low‐quality imaging data for training can enhance the robustness of the resulting models.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献