Application of convex hull analysis for the evaluation of data heterogeneity between patient populations of different origin and implications of hospital bias in downstream machine-learning-based data processing: A comparison of 4 critical-care patient datasets-Reference-Cited by-同舟云学术

Application of convex hull analysis for the evaluation of data heterogeneity between patient populations of different origin and implications of hospital bias in downstream machine-learning-based data processing: A comparison of 4 critical-care patient datasets

Published:2022-10-31 Issue: Volume:5 Page:
ISSN:2624-909X
Container-title:Frontiers in Big Data
language:
Short-container-title:Front. Big Data

Author:

Sharafutdinov Konstantin,Bhat Jayesh S.,Fritsch Sebastian Johannes,Nikulina Kateryna,E. Samadi Moein,Polzin Richard,Mayer Hannah,Marx Gernot,Bickenbach Johannes,Schuppert Andreas

Abstract

Machine learning (ML) models are developed on a learning dataset covering only a small part of the data of interest. If model predictions are accurate for the learning dataset but fail for unseen data then generalization error is considered high. This problem manifests itself within all major sub-fields of ML but is especially relevant in medical applications. Clinical data structures, patient cohorts, and clinical protocols may be highly biased among hospitals such that sampling of representative learning datasets to learn ML models remains a challenge. As ML models exhibit poor predictive performance over data ranges sparsely or not covered by the learning dataset, in this study, we propose a novel method to assess their generalization capability among different hospitals based on the convex hull (CH) overlap between multivariate datasets. To reduce dimensionality effects, we used a two-step approach. First, CH analysis was applied to find mean CH coverage between each of the two datasets, resulting in an upper bound of the prediction range. Second, 4 types of ML models were trained to classify the origin of a dataset (i.e., from which hospital) and to estimate differences in datasets with respect to underlying distributions. To demonstrate the applicability of our method, we used 4 critical-care patient datasets from different hospitals in Germany and USA. We estimated the similarity of these populations and investigated whether ML models developed on one dataset can be reliably applied to another one. We show that the strongest drop in performance was associated with the poor intersection of convex hulls in the corresponding hospitals' datasets and with a high performance of ML methods for dataset discrimination. Hence, we suggest the application of our pipeline as a first tool to assess the transferability of trained models. We emphasize that datasets from different hospitals represent heterogeneous data sources, and the transfer from one database to another should be performed with utmost care to avoid implications during real-world applications of the developed models. Further research is needed to develop methods for the adaptation of ML models to new hospitals. In addition, more work should be aimed at the creation of gold-standard datasets that are large and diverse with data from varied application sites.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Information Systems,Computer Science (miscellaneous)

Reference42 articles.

1. Deep learning for segmentation of brain tumors: Impact of cross-institutional training and testing;AlBadawy;Med. Phys.,2018

2. Deep learning algorithm predicts diabetic retinopathy progression in individual patients;Arcadu;NPJ Digit. Med.,2019

3. Acute respiratory distress syndrome: the Berlin Definition;Ranieri;JAMA,2012

4. Learning in high dimension always amounts to extrapolation;Balestriero;arXiv preprint arXiv:2110.09485,2021

5. External validation demonstrates limited clinical utility of the interpretable mortality prediction model for patients with COVID-19;Barish;Nat. Mach. Intell.,2021

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals;Scientific Reports;2024-03-08

2. Cross-Domain Feature learning and data augmentation for few-shot proxy development in oil industry;Applied Soft Computing;2023-12

3. Developing an Artificial Intelligence-Based Representation of a Virtual Patient Model for Real-Time Diagnosis of Acute Respiratory Distress Syndrome;Diagnostics;2023-06-17

4. Analysis of Chest X-ray for COVID-19 Diagnosis as a Use Case for an HPC-Enabled Data Analysis and Machine Learning Platform for Medical Diagnosis Support;Diagnostics;2023-01-20

5. Computational simulation of virtual patients reduces dataset bias and improves machine learning-based detection of ARDS from noisy heterogeneous ICU datasets;IEEE Open Journal of Engineering in Medicine and Biology;2023