Improved Interpretability of Machine Learning Model Using Unsupervised Clustering: Predicting Time to First Treatment in Chronic Lymphocytic Leukemia-Reference-Cited by-同舟云学术

Improved Interpretability of Machine Learning Model Using Unsupervised Clustering: Predicting Time to First Treatment in Chronic Lymphocytic Leukemia

Published:2019-12 Issue:3 Volume: Page:1-11
ISSN:2473-4276
Container-title:JCO Clinical Cancer Informatics
language:en
Short-container-title:JCO Clinical Cancer Informatics

Author:

Chen David¹,Goyal Gaurav¹,Go Ronald S.¹,Parikh Sameer A.¹,Ngufor Che G.¹

Affiliation:

1. Mayo Clinic Rochester, Rochester, MN

Abstract

PURPOSE Time to event is an important aspect of clinical decision making. This is particularly true when diseases have highly heterogeneous presentations and prognoses, as in chronic lymphocytic lymphoma (CLL). Although machine learning methods can readily learn complex nonlinear relationships, many methods are criticized as inadequate because of limited interpretability. We propose using unsupervised clustering of the continuous output of machine learning models to provide discrete risk stratification for predicting time to first treatment in a cohort of patients with CLL. PATIENTS AND METHODS A total of 737 treatment-naïve patients with CLL diagnosed at Mayo Clinic were included in this study. We compared predictive abilities for two survival models (Cox proportional hazards and random survival forest) and four classification methods (logistic regression, support vector machines, random forest, and gradient boosting machine). Probability of treatment was then stratified. RESULTS Machine learning methods did not yield significantly more accurate predictions of time to first treatment. However, automated risk stratification provided by clustering was able to better differentiate patients who were at risk for treatment within 1 year than models developed using standard survival analysis techniques. CONCLUSION Clustering the posterior probabilities of machine learning models provides a way to better interpret machine learning models.

Publisher

American Society of Clinical Oncology (ASCO)

Subject

General Medicine

Link

https://ascopubs.org/doi/pdfdirect/10.1200/CCI.18.00137

Reference36 articles.

1. Machine Learning in Medicine

2. Risk estimation and risk prediction using machine-learning methods

3. Unintended Consequences of Machine Learning in Medicine

4. Chronic lymphocytic leukemia: 2017 update on diagnosis, risk stratification, and treatment

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Serum CD133-Associated Proteins Identified by Machine Learning Are Connected to Neural Development, Cancer Pathways, and 12-Month Survival in Glioblastoma;Cancers;2024-08-01

2. Machine learning in infectious diseases: potential applications and limitations;Annals of Medicine;2024-06-10

3. SPIN-PM: a consensus framework to evaluate the presence of spin in studies on prediction models;Journal of Clinical Epidemiology;2024-06

4. Systematic review finds “spin” practices and poor reporting standards in studies on machine learning-based prediction models;Journal of Clinical Epidemiology;2023-06

5. A Review of Artificial Intelligence Applications in Hematology Management: Current Practices and Future Prospects;Journal of Medical Internet Research;2022-07-12