Methodology of solving the feature selection problem for the Cox regression model

Author:

Mikulik Il'ya Igorevich1

Affiliation:

1. Emperor Alexander I St. Petersburg State Transport University

Abstract

The technique based on the use of a hybrid optimization method to solve the feature selection problem for the Cox regression model is proposed. The hybrid optimization method includes two metaheuristic methods: the ant colony optimization and the genetic algorithm. The ant colony optimization used as the basic algorithm that solves the main optimization problem. The genetic algorithm solves the problem of finding the optimal set of parameters for the ant algorithm improving its performance. The method is modified and adapted to solve the problem under consideration. The key feature of adaptation is the deposition of pheromones on the vertices rather than on the edges of the graph, as well as the method for calculating the assessment of heuristic information about each vertex. A fitness target function was constructed that determines the quality of solutions to the feature selection problem and is based on an assessment of the performance of the Cox model with a selected set of parameters. The concordance index (c-index) was used to evaluate the Cox model. The efficiency of the methodology is shown using the implemented program using the example of a database of recidivism. For the database used, the most significant sets of features were obtained that have the greatest impact on the quality of training of the survival analysis model.

Publisher

Astrakhan State Technical University

Reference26 articles.

1. Archetti A., Lomurno E., Lattari F., Martin A., Matteucci M. Heterogeneous Datasets for Federated Survival Analysis Simulation // Companion of the 2023 ACM/SPEC International Conference on Performance Engineering, 2023. DOI: 10.1145/3578245.3584935., Archetti A., Lomurno E., Lattari F., Martin A., Matteucci M. Heterogeneous Datasets for Federated Survival Analysis Simulation. Companion of the 2023 ACM/SPEC International Conference on Performance Engineering, 2023. DOI: 10.1145/3578245.3584935.

2. George B., Seals S., Aban I. Survival analysis and regression models // J. Nucl. Cardiol. 2014. P. 686–694. DOI: 10.1007/s12350-014-9908-2., George B., Seals S., Aban I. Survival analysis and regression models. J. Nucl. Cardiol., 2014, pp. 686-694. DOI: 10.1007/s12350-014-9908-2.

3. Atlam M., Torkey H., El-Fishawy N., Salem H. Coronavirus disease 2019 (COVID-19): survival analysis using deep learning and Cox regression model // Pattern Anal. Applic. 2021. N. 24 (3). P. 993–1005. DOI: 10.1007/s10044-021-00958-0., Atlam M., Torkey H., El-Fishawy N., Salem H. Coronavirus disease 2019 (COVID-19): survival analysis using deep learning and Cox regression model. Pattern Anal. Applic., 2021, no. 24 (3), pp. 993-1005. DOI: 10.1007/s10044-021-00958-0.

4. Chen X., Yuan G., Nie F., Ming Z. Semi-Supervised Feature Selection via Sparse Rescaled Linear Square Regression // IEEE Transaction on Knowledge Discovery and Data Engineering 32, 2018. P. 165–176. DOI: 10.1109/TKDE.2018.2879797., Chen X., Yuan G., Nie F., Ming Z. Semi-Supervised Feature Selection via Sparse Rescaled Linear Square Regression. IEEE Transaction on Knowledge Discovery and Data Engineering 32, 2018. P. 165–176. DOI: 10.1109/TKDE.2018.2879797.

5. Rossi P. H., Berk R. A., Lenihan K. J. Money, work and crime: some experimental results. N. Y.: Academic Press, 1980. 336 p., Rossi P. H., Berk R. A., Lenihan K. J. Money, work and crime: some experimental results. New York, Academic Press, 1980. 336 p.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3