Machine Learning Based Predictive Model of 5-Year Survival in Multiple Myeloma Autologous Transplant Patients

Author:

Chen Robert1,Garapati Sankeerth1,Wu David12,Ko Sean13,Falk Sam14,Dierov Djamilia1,Stasiw Adam1,Opong Adomah S.15,Carson Kenneth R.671

Affiliation:

1. Flatiron Health Inc, New York, NY

2. Stanford University, Stanford, CA

3. Massachusetts Institute of Technology, Cambridge, MA

4. New York University, New York, NY

5. Weill Cornell Medicine, New York, NY

6. Washington University School of Medicine in St. Louis, Saint Louis, MO

7. St. Louis Veterans Affairs Medical Center, Saint Louis, MO

Abstract

Introduction: With recent improvements in therapy, median survival after diagnosis for patients with Multiple Myeloma (MM) now exceeds 5 years. However, for a variety of reasons most patients do not receive the most intensive therapy for MM, namely autologous stem cell transplantation (auto-SCT). In order to better understand which patients will receive the greatest benefit from auto-SCT, we developed a machine learning (ML) model to identify relevant predictors of 5-year mortality for MM patients who received auto-SCT. Previous studies have identified adverse cytogenetics (e.g., chromosome 13 deletion), elevated B2-microglobulin, elevated lactate dehydrogenase, and the receipt of more than 1 year of standard chemotherapy as prognostic risk factors. Using several ML techniques, we built a predictive model for 5-year mortality post-transplant and identified features that were most predictive. Methods: We used the de-identified nationwide Flatiron Health electronic health record (EHR)-derived database. Patients with a confirmed MM diagnosis between January 1, 2011 and May 31, 2019, and who received an auto-SCT at any date after their diagnosis, were eligible for analysis. Clinically relevant data available for these patients included demographic information, lab results, date of transplant, medications administered before transplant, lines of therapy, M-spike results, time from diagnosis to auto-SCT, and mortality data. Patients were classified based on 5-year survival after auto-SCT. Patients who underwent auto-SCT within the last 5 years and are still alive were excluded from this analysis, as were those who received an allogeneic transplant after auto-SCT. Using this feature set, we trained a variety of industry-standard ML models, including logistic regression, support vector machines, gradient boosted trees, and random forest. We used 5-fold cross validation to evaluate model performance, evaluated based on the area under the receiver operating curve (AUC). We then determined the relative importance of each feature in the logistic regression model and reviewed each feature's clinical relevance. Results: 1016 patients (588 patients who died within 5 years from their transplant date, and 428 patients who died 5 years or later following transplant) were included in the cohort. The logistic regression model was the best performing model, achieving an AUC of 0.77, accuracy of 0.70, and F1 score of 0.60 (Table). Eight of the 10 most predictive features for early mortality were presence of chromosome 1 abnormalities, higher age at diagnosis, higher serum albumin levels, higher number of visits before transplant, presence of ICD codes for comorbid conditions, and presence of administrations of pomalidomide, bortezomib, or zoledronic acid. These features were determined to be clinically meaningful and all were associated with mortality before 5 years. Receipt of fosaprepitant was also predictive of mortality before 5 years, though the clinical relationship is more challenging to explain. Presence of M-spike elevation in the 100 days before or after auto-SCT was predictive of reduced mortality. The latter finding, along with higher mortality in patients with higher serum albumin levels, was counterintuitive. Discussion: Auto-SCT procedures can have high toxicity and cost; therefore, accurate prediction of outcomes could improve understanding of the utility of auto-SCT for individual MM patients. Our study demonstrates the potential of using ML models for risk prediction in MM, though the presence of counterintuitive findings (e.g. higher albumin correlated to poorer survival) will require additional investigation. We hope that this study inspires future research into using ML techniques to support personalized clinical decision-making, help organize supportive care initiatives, and inform pre-approval decisions made by payers. Disclosures Chen: Flatiron Health, Inc: Employment; Roche: Equity Ownership. Garapati:Flatiron Health, Inc.: Employment. Wu:Flatiron Health: Employment. Ko:Flatiron Health, Inc.: Employment. Falk:Flatiron Health, Inc.: Employment; Roche: Equity Ownership. Dierov:Flatiron Health, Inc.: Employment; Roche: Equity Ownership. Stasiw:Roche: Equity Ownership; Flatiron Health, Inc.: Employment. Opong:Flatiron Health, Inc., which is an independent subsidiary of the Roche Group: Employment, Research Funding. Carson:Flatiron Health, Inc., which is an independent subsidiary of the Roche Group: Employment, Research Funding; Roche: Equity Ownership.

Publisher

American Society of Hematology

Subject

Cell Biology,Hematology,Immunology,Biochemistry

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3