Abstract
ABSTRACTDeep learning (DL) and machine learning (ML) models trained on long-term patient trajectories held as medical codes in electronic health records (EHR) have the potential to improve disease prediction. Anticoagulant prescribing decisions in atrial fibrillation (AF) offer a use case where the benchmark stroke risk prediction tool (CHA2DS2-VASc) could be meaningfully improved by including more information from a patient’s medical history. In this study, we design and build the first DL and ML pipeline that uses the routinely updated, linked EHR data for 56 million people in England accessed via NHS Digital to predict first ischaemic stroke in people with AF, and as a secondary outcome, COVID-19 death. Our pipeline improves first stroke prediction in AF by 17% compared to CHA2DS2-VASc (0.61 (0.57-0.65) vs 0.52 (0.52-0.52) area under the receiver operating characteristics curves, 95% confidence interval) and provides a generalisable, opensource framework that other researchers and developers can build on.
Publisher
Cold Spring Harbor Laboratory
Reference35 articles.
1. Artificial intelligence in healthcare: past, present and future
2. Vaswani, A. et al. Attention Is All You Need. Adv. Neural Inf. Process. Syst. 2017-December, 5999–6009 (2017).
3. BEHRT: Transformer for Electronic Health Records;Sci. Reports,2020
4. Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction;npj Digit. Med,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献