Abstract
We propose a method to predict when a woman will develop breast cancer (BCa) from her lifestyle and health history features. To address this objective, we use data from the Alberta’s Tomorrow Project of 18,288 women to train Individual Survival Distribution (ISD) models to predict an individual’s Breast-Cancer-Onset (BCaO) probability curve. We show that our three-step approach–(1) filling missing data with multiple imputations by chained equations, followed by (2) feature selection with the multivariate Cox method, and finally, (3) using MTLR to learn an ISD model–produced the model with the smallest L1-Hinge loss among all calibrated models with comparable C-index. We also identified 7 actionable lifestyle features that a woman can modify and illustrate how this model can predict the quantitative effects of those changes–suggesting how much each will potentially extend her BCa-free time. We anticipate this approach could be used to identify appropriate interventions for individuals with a higher likelihood of developing BCa in their lifetime.
Funder
Alberta Health, Alberta, Canada
Canadian Breast Cancer Foundation, Prairies/NWT Chapter, Canada
Alberta Cancer Foundation, Alberta, Canada
Canadian Partnership Against Cancer and Health Canada, Ontario, Canada
Alberta Health Services, Alberta, Canada
Alberta Machine Intelligence Institute
Natural Sciences and Engineering Research Council of Canada
Publisher
Public Library of Science (PLoS)
Reference75 articles.
1. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries;H Sung;CA Cancer J Clin,2021
2. Cancer statistics for the year 2020: An overview;J Ferlay;Int J Cancer,2021
3. Cancer Statistics, 2021;RL Siegel;CA Cancer J Clin,2021
4. Cancer statistics, 2020;RL Siegel;CA Cancer J Clin,2020
5. Breast cancer development and progression: Risk factors, cancer stem cells, signaling pathways, genomics, and molecular pathogenesis;Y Feng;Genes Dis,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献