Electronic health record signatures identify undiagnosed patients with common variable immunodeficiency disease

Author:

Johnson Ruth12ORCID,Stephens Alexis V.3,Mester Rachel1ORCID,Knyazev Sergey2ORCID,Kohn Lisa A.3ORCID,Freund Malika K.4,Bondhus Leroy4ORCID,Hill Brian L.1ORCID,Schwarz Tommer5,Zaitlen Noah6ORCID,Arboleda Valerie A.247ORCID,A. Bastarache Lisa8,Pasaniuc Bogdan2457,Butte Manish J.349ORCID

Affiliation:

1. Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA.

2. Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA.

3. Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California Los Angeles, Los Angeles, CA 90095, USA.

4. Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA.

5. Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA.

6. Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA.

7. Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA.

8. Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA 37203.

9. Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA.

Abstract

Human inborn errors of immunity include rare disorders entailing functional and quantitative antibody deficiencies due to impaired B cells called the common variable immunodeficiency (CVID) phenotype. Patients with CVID face delayed diagnoses and treatments for 5 to 15 years after symptom onset because the disorders are rare (prevalence of ~1/25,000), and there is extensive heterogeneity in CVID phenotypes, ranging from infections to autoimmunity to inflammatory conditions, overlapping with other more common disorders. The prolonged diagnostic odyssey drives excessive system-wide costs before diagnosis. Because there is no single causal mechanism, there are no genetic tests to definitively diagnose CVID. Here, we present PheNet, a machine learning algorithm that identifies patients with CVID from their electronic health records (EHRs). PheNet learns phenotypic patterns from verified CVID cases and uses this knowledge to rank patients by likelihood of having CVID. PheNet could have diagnosed more than half of our patients with CVID 1 or more years earlier than they had been diagnosed. When applied to a large EHR dataset, followed by blinded chart review of the top 100 patients ranked by PheNet, we found that 74% were highly probable to have CVID. We externally validated PheNet using >6 million records from disparate medical systems in California and Tennessee. As artificial intelligence and machine learning make their way into health care, we show that algorithms such as PheNet can offer clinical benefits by expediting the diagnosis of rare diseases.

Publisher

American Association for the Advancement of Science (AAAS)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3