Task-Oriented Predictive (Top)-BERT: Novel Approach for Predicting Diabetic Complications Using a Single-Center EHR Data-Reference-Cited by-同舟云学术

Task-Oriented Predictive (Top)-BERT: Novel Approach for Predicting Diabetic Complications Using a Single-Center EHR Data

Published:2024-04-16 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Islam Humayera,Bartlett Gillian,Pierce Robert,Rao Praveen,Waitman Lemuel R.,Song Xing

Abstract

AbstractIn this study, we assess the capacity of the BERT (Bidirectional Encoder Representations from Transformers) framework to predict a 12-month risk for major diabetic complications—retinopathy, nephropathy, neuropathy, and major adverse cardiovascular events (MACE) using a single-center EHR dataset. We introduce a task-oriented predictive (Top)-BERT architecture, which is a unique end-to-end training and evaluation framework utilizing sequential input structure, embedding layer, and encoder stacks inherent to BERT. This enhanced architecture trains and evaluates the model across multiple learning tasks simultaneously, enhancing the model’s ability to learn from a limited amount of data. Our findings demonstrate that this approach can outperform both traditional pretraining-finetuning BERT models and conventional machine learning methods, offering a promising tool for early identification of patients at risk of diabetes-related complications. We also investigate how different temporal embedding strategies affect the model’s predictive capabilities, with simpler designs yielding better performance. The use of Integrated Gradients (IG) augments the explainability of our predictive models, yielding feature attributions that substantiate the clinical significance of this study. Finally, this study also highlights the essential role of proactive symptom assessment and the management of comorbid conditions in preventing the advancement of complications in patients with diabetes.

Publisher

Cold Spring Harbor Laboratory

Reference56 articles.

1. Center for Disease Controls and Prevention National Diabetes Statistics Report 2020: Estimates of Diabetes and Its Burden in the United States. (2020).

2. Characterizing Multimorbidity from Type 2 Diabetes: Insights from Clustering Approaches;Endocrinol Metab Clin North Am,2021

3. Ndjaboue, R. et al. Predictive models of diabetes complications: protocol for a scoping. Syst Rev 9, (2020).

4. Predicting hospital readmission via cost-sensitive deep learning;IEEE/ACM transactions on computational biology and bioinformatic,2018

5. AI-Assisted Decision-making in Healthcare