Abstract
Machine learning (ML) is increasingly applied to predict adverse postoperative outcomes in cardiac surgery. Commonly used ML models fail to translate to clinical practice due to absent model explainability, limited uncertainty quantification, and no flexibility to missing data. We aimed to develop and benchmark a novel ML approach, the uncertainty-aware attention network (UAN), to overcome these common limitations. Two Bayesian uncertainty quantification methods were tested, generalized variational inference (GVI) or a posterior network (PN). The UAN models were compared with an ensemble of XGBoost models and a Bayesian logistic regression model (LR) with imputation. The derivation datasets consisted of 153,932 surgery events from the Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) Cardiac Surgery Database. An external validation consisted of 7343 surgery events which were extracted from the Medical Information Mart for Intensive Care (MIMIC) III critical care dataset. The highest performing model on the external validation dataset was a UAN-GVI with an area under the receiver operating characteristic curve (AUC) of 0.78 (0.01). Model performance improved on high confidence samples with an AUC of 0.81 (0.01). Confidence calibration for aleatoric uncertainty was excellent for all models. Calibration for epistemic uncertainty was more variable, with an ensemble of XGBoost models performing the best with an AUC of 0.84 (0.08). Epistemic uncertainty was improved using the PN approach, compared to GVI. UAN is able to use an interpretable and flexible deep learning approach to provide estimates of model uncertainty alongside state-of-the-art predictions. The model has been made freely available as an easy-to-use web application demonstrating that by designing uncertainty-aware models with innately explainable predictions deep learning may become more suitable for routine clinical use.
Funder
National Health and Medical Research Council
National Health and Medical Research Council Principal Research Fellowship
Publisher
Public Library of Science (PLoS)
Reference31 articles.
1. Artificial intelligence and machine learning in cardiovascular health care;A. Kilic;The Annals of Thoracic Surgery,2020
2. Predictors of total morbidity burden on days 3, 5 and 8 after cardiac surgery;J Sanders;Perioperative Medicine,2017
3. AusSCORE II in predicting 30-day mortality after isolated coronary artery bypass grafting in australia and new zealand;B Billah;The Journal of Thoracic and Cardiovascular Surgery,2014
4. EuroSCORE II;SAM Nashef;European Journal of Cardio-Thoracic Surgery,2012
5. A clinical score to predict acute renal failure after cardiac surgery;CV Thakar;Journal of the American Society of Nephrology,2004