Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes-Reference-Cited by-同舟云学术

Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes

Published:2020-09 Issue:4 Volume: Page:680-690
ISSN:2473-4276
Container-title:JCO Clinical Cancer Informatics
language:en
Short-container-title:JCO Clinical Cancer Informatics

Author:

Kehl Kenneth L.¹²,Xu Wenxin²³,Lepisto Eva¹²,Elmarakeby Haitham¹²⁴,Hassett Michael J.¹²,Van Allen Eliezer M.¹²⁴,Johnson Bruce E.¹²,Schrag Deborah¹²

Affiliation:

1. Dana-Farber Cancer Institute, Boston, MA

2. Harvard Medical School, Boston, MA

3. Beth Israel Deaconess Medical Center, Boston, MA

4. The Broad Institute, Cambridge, MA

Abstract

PURPOSE Cancer research using electronic health records and genomic data sets requires clinical outcomes data, which may be recorded only in unstructured text by treating oncologists. Natural language processing (NLP) could substantially accelerate extraction of this information. METHODS Patients with lung cancer who had tumor sequencing as part of a single-institution precision oncology study from 2013 to 2018 were identified. Medical oncologists’ progress notes for these patients were reviewed. For each note, curators recorded whether the assessment/plan indicated any cancer, progression/worsening of disease, and/or response to therapy or improving disease. Next, a recurrent neural network was trained using unlabeled notes to extract the assessment/plan from each note. Finally, convolutional neural networks were trained on labeled assessments/plans to predict the probability that each curated outcome was present. Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC) among a held-out test set of 10% of patients. Associations between curated response or progression end points and overall survival were measured using Cox models among patients receiving palliative-intent systemic therapy. RESULTS Medical oncologist notes (n = 7,597) were manually curated for 919 patients. In the 10% test set, NLP models replicated human curation with AUROCs of 0.94 for the any-cancer outcome, 0.86 for the progression outcome, and 0.90 for the response outcome. Progression/worsening events identified using NLP models were associated with shortened survival (hazard ratio [HR] for mortality, 2.49; 95% CI, 2.00 to 3.09); response/improvement events were associated with improved survival (HR, 0.45; 95% CI, 0.30 to 0.67). CONCLUSION NLP models based on neural networks can extract meaningful outcomes from oncologist notes at scale. Such models may facilitate identification of clinical and genomic features associated with response to cancer treatment.

Publisher

American Society of Clinical Oncology (ASCO)

Subject

General Medicine

Link

https://ascopubs.org/doi/pdfdirect/10.1200/CCI.20.00020

Reference20 articles.

1. Real-World Evidence — What Is It and What Can It Tell Us?

2. The Evolving Uses of “Real-World” Data

3. Race, Poverty, and Initial Implementation of Precision Medicine for Lung Cancer

4. Schrag D: GENIE: Real-world application. Presented at the 2018 ASCO Annual Meeting, Chicago, IL, June 1-5, 2018

Cited by 49 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Digital-Focused Approaches in Cancer Patients’ Management in the Post-COVID Era: Challenges and Solutions;Applied Sciences;2024-09-10

2. Defining the quality of interdisciplinary care for patients with brain metastases: modified Delphi panel recommendations;The Lancet Oncology;2024-09

3. Symptom-BERT: Enhancing Cancer Symptom Detection in EHR Clinical Notes;Journal of Pain and Symptom Management;2024-08

4. Potential application of artificial intelligence in cancer therapy;Current Opinion in Oncology;2024-06-24

5. Analysis of Data Extraction Techniques on Medical Health Records;2024 International Conference on Electronics, Computing, Communication and Control Technology (ICECCC);2024-05-02