Artificial intelligence-aided clinical annotation of a large multi-cancer genomic dataset-Reference-Cited by-同舟云学术

Artificial intelligence-aided clinical annotation of a large multi-cancer genomic dataset

Published:2021-12 Issue:1 Volume:12 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Kehl Kenneth L.^ORCID,Xu Wenxin^ORCID,Gusev Alexander^ORCID,Bakouny Ziad,Choueiri Toni K.^ORCID,Riaz Irbaz Bin,Elmarakeby Haitham,Van Allen Eliezer M.^ORCID,Schrag Deborah

Abstract

AbstractTo accelerate cancer research that correlates biomarkers with clinical endpoints, methods are needed to ascertain outcomes from electronic health records at scale. Here, we train deep natural language processing (NLP) models to extract outcomes for participants with any of 7 solid tumors in a precision oncology study. Outcomes are extracted from 305,151 imaging reports for 13,130 patients and 233,517 oncologist notes for 13,511 patients, including patients with 6 additional cancer types. NLP models recapitulate outcome annotation from these documents, including the presence of cancer, progression/worsening, response/improvement, and metastases, with excellent discrimination (AUROC > 0.90). Models generalize to cancers excluded from training and yield outcomes correlated with survival. Among patients receiving checkpoint inhibitors, we confirm that high tumor mutation burden is associated with superior progression-free survival ascertained using NLP. Here, we show that deep NLP can accelerate annotation of molecular cancer datasets with clinically meaningful endpoints to facilitate discovery.

Funder

Doris Duke Charitable Foundation

U.S. Department of Health & Human Services | NIH | National Cancer Institute

American Association for Cancer Research

Kohlberg Chair at Harvard Medical School Trust Family, Michael Brigham, and Loker Pinard Funds for Kidney Cancer Research, Dana-Farber Cancer Institute

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry

Link

https://www.nature.com/articles/s41467-021-27358-6.pdf

Reference32 articles.

1. Garraway, L. A., Verweij, J. & Ballman, K. V. Precision oncology: an overview. J. Clin. Oncol. 31, 1803–1805 (2013).

2. AACR Project GENIE Consortium. AACR Project GENIE: Powering Precision Medicine through an International Consortium. Cancer Disco. 7, 818–831 (2017).

3. Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med 23, 703–713 (2017).

4. Sholl, L. M. et al. Institutional implementation of clinical tumor profiling on an unselected cancer population. JCI insight 1, e87062 (2016).

5. Cancer Genome Atlas Research Network, Weinstein, J. N. et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat. Genet. 45, 1113–1120 (2013).

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence across oncology specialties: current applications and emerging tools;BMJ Oncology;2024-01

2. Data-Driven Approaches in Healthcare: Challenges and Emerging Trends;Multidisciplinary Perspectives on Artificial Intelligence and the Law;2023-12-27

3. Overview of approaches to estimate real-world disease progression in lung cancer;JNCI Cancer Spectrum;2023-09-21

4. Empirical evaluation of language modeling to ascertain cancer outcomes from clinical text reports;BMC Bioinformatics;2023-09-02

5. Artificial intelligence-aided optical imaging for cancer theranostics;Seminars in Cancer Biology;2023-09