MC‐KV: A Prognosis‐Oriented Classifier Based on Semi‐Supervised Learning for Molecular Subtyping of Colorectal Cancer

Author:

Bai Liyi1234,Bu Fanqin1234,Li Xiangji1234,Yang Xiaohan1234,Guo Shuilong1234,Min Li1234ORCID,Zhang Shutian1234

Affiliation:

1. Department of Gastroenterology Beijing Friendship Hospital Capital Medical University Beijing 100050 China

2. Beijing Key Laboratory for Precancerous Lesion of Digestive Diseases Beijing 100050 China

3. National Clinical Research Center for Digestive Diseases Beijing 100050 China

4. Beijing Digestive Disease Center Beijing Beijing 100050 China

Abstract

AbstractColorectal cancer (CRC) is the second leading cause of cancer‐related death worldwide. Many molecular classification strategies are proposed for CRC but few studies include survival data in their models. Herein a prognosis‐oriented CRC classifier is constructed by adapting the natural partially labeled censored survival data into a customized semi‐supervised learning algorithm, which is called Monte‐Carlo K‐nearest neighbor voting (MC‐KV) classifier. Three CRC subtypes with distinct prognoses are identified by this classifier using the data from the cancer genome atlas. Furthermore, a six‐gene risk model is constructed by combining weighted gene coexpression network analysis and least absolute selection and shrinkage operator for variable selection and four algorithms (random survival forest, support vector machine, Adaboost, and logistic regression) for optimization. The optimized model shows great performance in distinguishing high‐risk from low‐risk patients with a maximum area under curve of 0.869, 0.906, and 0.921 in 1‐, 3‐, and 5‐year survival, respectively. Additionally, the six‐gene signature identified by MC‐KV exhibits great predictive efficiency for other cancer types. Overall, a tool, Monte‐Carlo K‐nearest neighbor voting (MC‐KV), is provided to identify molecular subtyping of CRC, which suggests the potential contribution of semi‐supervised algorithms and the inclusion of patient‐level survival data in cancer classification.

Funder

National Natural Science Foundation of China

Publisher

Wiley

Subject

Multidisciplinary,Modeling and Simulation,Numerical Analysis,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3