Development and Validation of a Machine Learning Approach Leveraging Real-World Clinical Narratives as a Predictor of Survival in Advanced Cancer-Reference-Cited by-同舟云学术

Development and Validation of a Machine Learning Approach Leveraging Real-World Clinical Narratives as a Predictor of Survival in Advanced Cancer

Published:2022-10 Issue:6 Volume: Page:
ISSN:2473-4276
Container-title:JCO Clinical Cancer Informatics
language:en
Short-container-title:JCO Clinical Cancer Informatics

Author:

Lin Frank Po-Yen¹²³⁴^ORCID,Salih Osama S.M.³⁵,Scott Nina⁶^ORCID,Jameson Michael B.³⁶^ORCID,Epstein Richard J.⁴⁷⁸^ORCID

Affiliation:

1. Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, Australia

2. NHMRC Clinical Trials Centre, Sydney University, Camperdown, Australia

3. Department of Medical Oncology, Waikato Hospital, Hamilton, New Zealand

4. School of Clinical Medicine, University of New South Wales, Sydney, Australia

5. Auckland City Hospital, Auckland, New Zealand

6. Waikato Clinical Campus, University of Auckland, Hamilton, New Zealand

7. Cancer Research Division, Garvan Institute of Medical Research, Sydney, Australia

8. New Hope Cancer Centre, Beijing United Hospital, Beijing, China

Abstract

PURPOSE Predicting short-term mortality in patients with advanced cancer remains challenging. Whether digitalized clinical text can be used to build models to enhance survival prediction in this population is unclear. MATERIALS AND METHODS We conducted a single-centered retrospective cohort study in patients with advanced solid tumors. Clinical correspondence authored by oncologists at the first patient encounter was extracted from the electronic medical records. Machine learning (ML) models were trained using narratives from the derivation cohort, before being tested on a temporal validation cohort at the same site. Performance was benchmarked against Eastern Cooperative Oncology Group performance status (PS), comparing ML models alone (comparison 1) or in combination with PS (comparison 2), assessed by areas under receiver operating characteristic curves (AUCs) for predicting vital status at 11 time points from 2 to 52 weeks. RESULTS ML models were built on the derivation cohort (4,791 patients from 2001 to April 2017) and tested on the validation cohort of 726 patients (May 2017-June 2019). In 441 patients (61%) where clinical narratives were available and PS was documented, ML models outperformed the predictivity of PS (mean AUC improvement, 0.039, P < .001, comparison 1). Inclusion of both clinical text and PS in ML models resulted in further improvement in prediction accuracy over PS with a mean AUC improvement of 0.050 ( P < .001, comparison 2); the AUC was > 0.80 at all assessed time points for models incorporating clinical text. Exploratory analysis of oncologist's narratives revealed recurring descriptors correlating with survival, including referral patterns, mobility, physical functions, and concomitant medications. CONCLUSION Applying ML to oncologists' narratives with or without including patient's PS significantly improved survival prediction to 12 months, suggesting the utility of clinical text in building prognostic support tools.

Publisher

American Society of Clinical Oncology (ASCO)

Subject

General Medicine

Link

https://ascopubs.org/doi/pdfdirect/10.1200/CCI.22.00064

Reference37 articles.

1. Toxicity and response criteria of the Eastern Cooperative Oncology Group

2. Co-morbidity leads to altered treatment and worse survival of elderly patients with colorectal cancer

3. Racial and Ethnic Differences in Breast Cancer Survival: Mediating Effect of Tumor Characteristics and Sociodemographic and Treatment Factors

4. Economic downturns, universal health coverage, and cancer mortality in high-income and middle-income countries, 1990–2010: a longitudinal analysis

5. A systematic review of physicians' survival predictions in terminally ill cancer patients