Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports-Reference-Cited by-同舟云学术

Development and Validation of a Model to Identify Critical Brain Injuries Using Natural Language Processing of Text Computed Tomography Reports

Published:2022-08-16 Issue:8 Volume:5 Page:e2227109
ISSN:2574-3805
Container-title:JAMA Network Open
language:en
Short-container-title:JAMA Netw Open

Author:

Torres-Lopez Victor M.¹,Rovenolt Grace E.¹,Olcese Angelo J.¹,Garcia Gabriella E.¹,Chacko Sarah M.¹,Robinson Amber¹,Gaiser Edward¹,Acosta Julian¹,Herman Alison L.¹,Kuohn Lindsey R.¹,Leary Megan¹,Soto Alexandria L.¹,Zhang Qiang¹,Fatima Safoora²,Falcone Guido J.¹,Payabvash Seyedmehdi³,Sharma Richa¹,Struck Aaron F.²⁴,Sheth Kevin N.¹,Westover M. Brandon⁵,Kim Jennifer A.¹

Affiliation:

1. Department of Neurology, Yale University, New Haven, Connecticut

2. Department of Neurology, University of Wisconsin, Madison

3. Department of Radiology, Yale University, New Haven, Connecticut

4. William S Middleton Veterans Hospital, Madison, Wisconsin

5. Department of Neurology, Massachusetts General Hospital, Boston

Abstract

ImportanceClinical text reports from head computed tomography (CT) represent rich, incompletely utilized information regarding acute brain injuries and neurologic outcomes. CT reports are unstructured; thus, extracting information at scale requires automated natural language processing (NLP). However, designing new NLP algorithms for each individual injury category is an unwieldy proposition. An NLP tool that summarizes all injuries in head CT reports would facilitate exploration of large data sets for clinical significance of neuroradiological findings.ObjectiveTo automatically extract acute brain pathological data and their features from head CT reports.Design, Setting, and ParticipantsThis diagnostic study developed a 2-part named entity recognition (NER) NLP model to extract and summarize data on acute brain injuries from head CT reports. The model, termed BrainNERD, extracts and summarizes detailed brain injury information for research applications. Model development included building and comparing 2 NER models using a custom dictionary of terms, including lesion type, location, size, and age, then designing a rule-based decoder using NER outputs to evaluate for the presence or absence of injury subtypes. BrainNERD was evaluated against independent test data sets of manually classified reports, including 2 external validation sets. The model was trained on head CT reports from 1152 patients generated by neuroradiologists at the Yale Acute Brain Injury Biorepository. External validation was conducted using reports from 2 outside institutions. Analyses were conducted from May 2020 to December 2021.Main Outcomes and MeasuresPerformance of the BrainNERD model was evaluated using precision, recall, and F1 scores based on manually labeled independent test data sets.ResultsA total of 1152 patients (mean [SD] age, 67.6 [16.1] years; 586 [52%] men), were included in the training set. NER training using transformer architecture and bidirectional encoder representations from transformers was significantly faster than spaCy. For all metrics, the 10-fold cross-validation performance was 93% to 99%. The final test performance metrics for the NER test data set were 98.82% (95% CI, 98.37%-98.93%) for precision, 98.81% (95% CI, 98.46%-99.06%) for recall, and 98.81% (95% CI, 98.40%-98.94%) for the F score. The expert review comparison metrics were 99.06% (95% CI, 97.89%-99.13%) for precision, 98.10% (95% CI, 97.93%-98.77%) for recall, and 98.57% (95% CI, 97.78%-99.10%) for the F score. The decoder test set metrics were 96.06% (95% CI, 95.01%-97.16%) for precision, 96.42% (95% CI, 94.50%-97.87%) for recall, and 96.18% (95% CI, 95.151%-97.16%) for the F score. Performance in external institution report validation including 1053 head CR reports was greater than 96%.Conclusions and RelevanceThese findings suggest that the BrainNERD model accurately extracted acute brain injury terms and their properties from head CT text reports. This freely available new tool could advance clinical research by integrating information in easily gathered head CT reports to expand knowledge of acute brain injury radiographic phenotypes.

Publisher

American Medical Association (AMA)

Subject

General Medicine

Link

https://jamanetwork.com/journals/jamanetworkopen/articlepdf/2795179/torreslopez_2022_oi_220765_1664286350.55909.pdf

Reference42 articles.

1. Heart disease and stroke statistics—2020 update: a report from the American Heart Association.;Virani;Circulation,2020

2. Performance of e-ASPECTS software in comparison to that of stroke physicians on assessing CT scans of acute ischemic stroke patients.;Herweh;Int J Stroke,2016

3. Automated detection, localization, and classification of traumatic vertebral body fractures in the thoracic and lumbar spine at CT.;Burns;Radiology,2016

4. Applications of radiomics in precision diagnosis, prognostication and treatment planning of head and neck squamous cell carcinomas.;Haider;Cancers Head Neck,2020

5. Radiomics: extracting more information from medical images using advanced feature analysis.;Lambin;Eur J Cancer,2012

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A scoping review of large language model based approaches for information extraction from radiology reports;npj Digital Medicine;2024-08-24

2. Bidirectional Encoder Representations from Transformers in Radiology: A Systematic Review of Natural Language Processing Applications;Journal of the American College of Radiology;2024-06

3. Data Extraction from Free-Text Reports on Mechanical Thrombectomy in Acute Ischemic Stroke Using ChatGPT: A Retrospective Analysis;Radiology;2024-04-01

4. Uncertainty-aware deep-learning model for prediction of supratentorial hematoma expansion from admission non-contrast head computed tomography scan;npj Digital Medicine;2024-02-06

5. Time-Dependent Changes in Hematoma Expansion Rate after Supratentorial Intracerebral Hemorrhage and Its Relationship with Neurological Deterioration and Functional Outcome;Diagnostics;2024-01-31