Abstract
AbstractUnderstanding various subpopulations in chronic kidney disease can improve patient care and aid in developing treatments targeted to patients’ needs. Due to the general slow disease progression, electronic health records, which comprise a rich source of longitudinal real-world patient-level information, offer an approach for generating insights into disease. Here we apply the open-source ConvAE framework to train an unsupervised deep learning network using a real-world kidney disease cohort consisting of 2.2 million US patients from the OPTUM® EHR database. Numerical patient representations derived from ConvAE are used to derive disease subtypes, inform comorbidities and understand rare disease populations. To identify patients at high risk to develop end-stage kidney disease, we extend a validated algorithm classifying disease severity to hypothesize subpopulations of rapid chronic kidney disease progressors. We demonstrate that using a combination of data-driven methods offers a powerful exploratory approach to understand disease heterogeneity and identify high-risk patients who could be targeted for early therapeutic intervention to prevent end-stage kidney disease.
Publisher
Cold Spring Harbor Laboratory
Reference28 articles.
1. Allam, Ahmed , Stefan Feuerriegel , Michael Rebhan , and Michael Krauthammer . 2021. “Analyzing Patient Trajectories With Artificial Intelligence.” Journal of Medical Internet Research 23 (12). https://doi.org/10.2196/29812.
2. Mining the Electronic Health Record for Disease Knowledge;Methods in Molecular Biology (Clifton, N.J,2014
3. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding;NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference,2018
4. Dong, Guiying , Jianfeng Feng , Fengzhu Sun , Jingqi Chen , and Xing Ming Zhao . 2021. “A Global Overview of Genetically Interpretable Multimorbidities among Common Diseases in the UK Biobank.” Genome Medicine 13 (1). https://doi.org/10.1186/S13073-021-00927-6.