Sensitivity of Survival Analysis Metrics

Author:

Vasilev Iulii1ORCID,Petrovskiy Mikhail1ORCID,Mashechkin Igor1ORCID

Affiliation:

1. Computer Science Department, Lomonosov Moscow State University, Vorobjovy Gory, 119899 Moscow, Russia

Abstract

Survival analysis models allow for predicting the probability of an event over time. The specificity of the survival analysis data includes the distribution of events over time and the proportion of classes. Late events are often rare and do not correspond to the main distribution and strongly affect the quality of the models and quality assessment. In this paper, we identify four cases of excessive sensitivity of survival analysis metrics and propose methods to overcome them. To set the equality of observation impacts, we adjust the weights of events based on target time and censoring indicator. According to the sensitivity of metrics, AUPRC (area under Precision-Recall curve) is best suited for assessing the quality of survival models, and other metrics are used as loss functions. To evaluate the influence of the loss function, the Bagging model uses ones to select the size and hyperparameters of the ensemble. The experimental study included eight real medical datasets. The proposed modifications of IBS (Integrated Brier Score) improved the quality of Bagging compared to the classical loss functions. In addition, in seven out of eight datasets, the Bagging with new loss functions outperforms the existing models of the scikit-survival library.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Reference45 articles.

1. Kleinbaum, D., and Klein, M. (2016). Survival Analysis: A Self-Learning Text, Springer. [3rd ed.]. Statistics for Biology and Health.

2. Machine learning for survival analysis: A survey;Wang;ACM Comput. Surv. (CSUR),2019

3. Weighted Log-Rank Statistics for Accelerated Failure Time Model;Lee;Stats,2021

4. Examining tests for comparing survival curves with right censored data;Karadeniz;Stat Transit,2017

5. On the versatility of the combination of the weighted log-rank statistics;Lee;Comput. Stat. Data Anal.,2007

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3