Abstract
AbstractClustering is the task of identifying groups of similar subjects according to certain criteria. The AJCC staging system can be thought as a clustering mechanism that groups patients based on their disease stage. This grouping drives prognosis and influences treatment. The goal of this work is to evaluate the efficacy of machine learning algorithms to cluster the patients into discriminative groups to improve prognosis for overall survival (OS) and relapse free survival (RFS) outcomes. We apply clustering over a retrospectively collected data from 644 head and neck cancer patients including both clinical and radiomic features. In order to incorporate outcome information into the clustering process and deal with the large proportion of censored samples, the feature space was scaled using the regression coefficients fitted using a proxy dependent variable, martingale residuals, instead of follow-up time. Two clusters were identified and evaluated using cross validation. The Kaplan Meier (KM) curves between the two clusters differ significantly for OS and RFS (p-value < 0.0001). Moreover, there was a relative predictive improvement when using the cluster label in addition to the clinical features compared to using only clinical features where AUC increased by 5.7% and 13.0% for OS and RFS, respectively.
Funder
U.S. Department of Health & Human Services | NIH | National Cancer Institute
National Science Foundation
Philanthropic donations from the Family of Paul W. Beach to Dr. G. Brandon Gunn, MD
Egyptian American conjoint PhD program funded by the Egyptian Cultural and Educational bureau
U.S. Department of Health & Human Services | National Institutes of Health
Feinberg Foundation
U.S. Department of Health & Human Services | NIH | National Institute of Dental and Craniofacial Research
NSF | Directorate for Mathematical & Physical Sciences | Division of Mathematical Sciences
Andrew Sabin Family Foundation (Sabin Family Foundation Fellow). Direct industry grant support and travel funding from Elekta AB.
Publisher
Springer Science and Business Media LLC
Reference51 articles.
1. Rahib, L. et al. Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the united states. Cancer research 74, 2913–2921 (2014).
2. On Cancers, T. A. J. C. Cancer staging system, https://cancerstaging.org/references-tools/Pages/What-is-Cancer-Staging.aspx. Online; accessed Sept (2017).
3. Castellano, G., Bonilha, L., Li, L. & Cendes, F. Texture analysis of medical images. Clin. radiology 59, 1061–1069 (2004).
4. Parmar, C. et al. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Front. oncology 5 (2015).
5. Leger, S. et al. A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling. Sci. Reports 7, 13206 (2017).