Abstract
ABSTRACTObjectiveThis paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.Materials and MethodsThe method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer’s disease (AD) across 10 different observational data sources.ResultsBy utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease’s anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.DiscussionWe provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.ConclusionDiagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs.
Publisher
Cold Spring Harbor Laboratory
Reference32 articles.
1. A collaborative approach to developing an electronic health record phenotyping algorithm for drug-induced liver injury
2. High-fidelity phenotyping: richness and freedom from bias;Journal of the American Medical Informatics Association,2017
3. Weaver, J. , et al. Best Practices for Creating the Standardized Content of an Entry in the OHDSI Phenotype Library. in 5th OHDSI Annual Symposium. 2019.
4. Chapman, M. , et al., Desiderata for the development of next-generation electronic health record phenotype libraries. GigaScience, 2021. 10(9): p. giab059.
5. Kuha, J. , C. Skinner , and J. Palmgren , Misclassification Error, in Encyclopedia of Biostatistics. 2005.