TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records-Reference-Cited by-同舟云学术

TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records

Published:2023-11-29 Issue:1 Volume:14 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Yang Zhichao^ORCID,Mitra Avijit,Liu Weisong,Berlowitz Dan,Yu Hong^ORCID

Abstract

AbstractDeep learning transformer-based models using longitudinal electronic health records (EHRs) have shown a great success in prediction of clinical diseases or outcomes. Pretraining on a large dataset can help such models map the input space better and boost their performance on relevant tasks through finetuning with limited data. In this study, we present TransformEHR, a generative encoder-decoder model with transformer that is pretrained using a new pretraining objective—predicting all diseases and outcomes of a patient at a future visit from previous visits. TransformEHR’s encoder-decoder framework, paired with the novel pretraining objective, helps it achieve the new state-of-the-art performance on multiple clinical prediction tasks. Comparing with the previous model, TransformEHR improves area under the precision–recall curve by 2% (p < 0.001) for pancreatic cancer onset and by 24% (p = 0.007) for intentional self-harm in patients with post-traumatic stress disorder. The high performance in predicting intentional self-harm shows the potential of TransformEHR in building effective clinical intervention systems. TransformEHR is also generalizable and can be easily finetuned for clinical prediction tasks with limited data.

Funder

U.S. Department of Health & Human Services | NIH | National Institute of Mental Health

U.S. Department of Health & Human Services | NIH | National Institute on Drug Abuse

U.S. Department of Health & Human Services | NIH | National Institute on Aging

U.S. Department of Veterans Affairs

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry,Multidisciplinary

Link

https://www.nature.com/articles/s41467-023-43715-z.pdf

Reference58 articles.

1. Kessler, R. C. et al. Using administrative data to predict suicide after psychiatric hospitalization in the veterans health administration system. Front. Psychiatry 11, 390 (2020).

2. Zhao, W., Jiang, W. & Qiu, X. Deep learning for COVID-19 detection based on CT images. Sci. Rep. 11, 14353 (2021).

3. Goh, K. H. et al. Artificial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat. Commun. 12, 711 (2021).

4. Wornow, M. et al. The shaky foundations of large language models and foundation models for electronic health records. NPJ Digit. Med. 6, 135 (2023).

5. Choi, E. et al. RETAIN: an interpretable predictive model for healthcare using reverse time attention mechanism. In 30th Annual Conference on Neural Information Processing Systems (NIPS 2016). Advances in Neural Information Processing Systems 3512–3520 (2016).

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Integrating multi-task and cost-sensitive learning for predicting mortality risk of chronic diseases in the elderly using real-world data;International Journal of Medical Informatics;2024-11

2. Self-attention with temporal prior: can we learn more from the arrow of time?;Frontiers in Artificial Intelligence;2024-08-06

3. Optimizing pain management in breast cancer care: Utilizing ‘All of Us’ data and deep learning to identify patients at elevated risk for chronic pain;Journal of Nursing Scholarship;2024-07-26

4. Enhancing Type 2 Diabetes Treatment Decisions With Interpretable Machine Learning Models for Predicting Hemoglobin A1c Changes: Machine Learning Model Development;JMIR AI;2024-07-18

5. Predicting Autism Spectrum Disorder: Transformer-Based Deep Learning Ensemble Framework Using Health Administrative & Birth Registry Data;2024-07-05