Abstract
AbstractIn the area of medical imaging, one of the factors that can negatively influence the performance of prediction algorithms is the limited number of observations for each class within a labeled dataset. Usually, in order to increase the samples, a second set of unlabeled images is used. However, this set adds two new problems (i) finding patient observations with different pathologies than those observed in the labeled data set and (ii) finding images belonging to a different distribution from the dataset used in the model training process. This way, merging datasets from different sources can have an adverse effect on the distribution of features. Encountering this type of data (better known as out-of-distribution data) within the deployment environments may also lead to varying degrees of performance degradation as can be seen in the different experimental results obtained. In this research, a study of the behavior of Feature Density is made, as a mathematical model for the estimation of predictive uncertainty in supervised classification algorithms, in order to improve the behavior when out-of-distribution data are presented in the dataset. The Feature Density method is based on the estimation of feature density by means of histogram calculation (or Probability Density Function). The advantage of this method over the baseline approach (Mahalanobis distance) is that it does not assume a Gaussian-type distribution of sample characteristics and serves to estimate the uncertainty. This work focuses on the binary classification of mammography X-ray images from three different datasets simulating the condition of a different degree of contamination with out-of-distribution sample. According to the obtained results, the performance of the proposed method depends directly on the architecture of the implemented neural network.
Funder
Ministerio de Ciencia, Innovación y Universidades
Junta de Andalucía
Universidad de Málaga
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Software
Reference26 articles.
1. Iliadis L, Magri L (2022) Special issue on deep learning modeling in real life: anomaly detection, biomedical, concept analysis, finance, image analysis, recommendation. Neural Comput Appl 34:19397–19400
2. Calderon-Ramirez S, Yang S, Moemeni A, Colreavy-Donnelly S, Elizondo DA, Oala L, Rodríguez-Capitán J, Jiménez-Navarro M, López-Rubio E, Molina-Cabello MA (2021) Improving uncertainty estimation with semi-supervised deep learning for Covid-19 detection using chest x-ray images. IEEE Access 9:85442–85454
3. Wild C, Weiderpass E, Stewart B (2020) World cancer report: cancer research for cancer prevention. International Agency for Research on Cancer, Lyon, France
4. Society AC, Society (2022) Breast cancer facts and figures 2022. American Cancer Society, Atlanta
5. Calderón Ramírez S, Murillo-Hernández D, Rojas-Salazar K, Elizondo D, Moemeni A, Molina-Cabello MA (2022) A real use case of semi-supervised learning for mammogram classification in a local clinic of Costa Rica. Med Biol Eng Comput 60(4):1159–1175
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献