Large-scale identification of patients with cerebral aneurysms using natural language processing-Reference-Cited by-同舟云学术

Large-scale identification of patients with cerebral aneurysms using natural language processing

Published:2016-12-07 Issue:2 Volume:88 Page:164-168
ISSN:0028-3878
Container-title:Neurology
language:en
Short-container-title:Neurology

Author:

Castro Victor M.,Dligach Dmitriy,Finan Sean,Yu Sheng,Can Anil,Abd-El-Barr Muhammad,Gainer Vivian,Shadick Nancy A.,Murphy Shawn,Cai Tianxi,Savova Guergana,Weiss Scott T.,Du Rose

Abstract

Objective:To use natural language processing (NLP) in conjunction with the electronic medical record (EMR) to accurately identify patients with cerebral aneurysms and their matched controls.Methods:ICD-9 and Current Procedural Terminology codes were used to obtain an initial data mart of potential aneurysm patients from the EMR. NLP was then used to train a classification algorithm with .632 bootstrap cross-validation used for correction of overfitting bias. The classification rule was then applied to the full data mart. Additional validation was performed on 300 patients classified as having aneurysms. Controls were obtained by matching age, sex, race, and healthcare use.Results:We identified 55,675 patients of 4.2 million patients with ICD-9 and Current Procedural Terminology codes consistent with cerebral aneurysms. Of those, 16,823 patients had the term aneurysm occur near relevant anatomic terms. After training, a final algorithm consisting of 8 coded and 14 NLP variables was selected, yielding an overall area under the receiver-operating characteristic curve of 0.95. After the final algorithm was applied, 5,589 patients were classified as having aneurysms, and 54,952 controls were matched to those patients. The positive predictive value based on a validation cohort of 300 patients was 0.86.Conclusions:We harnessed the power of the EMR by applying NLP to obtain a large cohort of patients with intracranial aneurysms and their matched controls. Such algorithms can be generalized to other diseases for epidemiologic and genetic studies.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Neurology (clinical)

Reference15 articles.

1. Prevalence of unruptured intracranial aneurysms, with emphasis on sex, age, comorbidity, country, and time period: a systematic review and meta-analysis

2. Identification of subjects with polycystic ovary syndrome using electronic health records

3. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system

4. Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls

5. Nalichowski R , Keogh D , Chueh HC , Murphy SN . Calculating the benefits of a research patient data repository. AMIA Annu Symp Proc 2006:1044.

Cited by 85 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Family history as the strongest predictor of aortic and peripheral aneurysms in patients with intracranial aneurysms;Journal of Clinical Neuroscience;2024-08

2. The Strategic Efficacy of Artificial Intelligence (AI) in Medical Tourism;Advances in Hospitality, Tourism, and the Services Industry;2024-05-10

3. Global trend in research of intracranial aneurysm management with artificial intelligence technology: a bibliometric analysis;Quantitative Imaging in Medicine and Surgery;2024-01

4. An Operative Medical Angio-Image Improvement with Adaptive Fractional Differential Filter;SN Computer Science;2023-11-03

5. Development and external validation of multimodal postoperative acute kidney injury risk machine learning models;JAMIA Open;2023-10-04