Information‐incorporated sparse hierarchical cancer heterogeneity analysis

Author:

Han Wei12ORCID,Zhang Sanguo12,Ma Shuangge3ORCID,Ren Mingyang4ORCID

Affiliation:

1. School of Mathematical Sciences University of Chinese Academy of Sciences Beijing China

2. Key Laboratory of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing China

3. Department of Biostatistics Yale School of Public Health New Haven Connecticut

4. School of Mathematical Sciences Shanghai Jiao Tong University Shanghai China

Abstract

Cancer heterogeneity analysis is essential for precision medicine. Most of the existing heterogeneity analyses only consider a single type of data and ignore the possible sparsity of important features. In cancer clinical practice, it has been suggested that two types of data, pathological imaging and omics data, are commonly collected and can produce hierarchical heterogeneous structures, in which the refined sub‐subgroup structure determined by omics features can be nested in the rough subgroup structure determined by the imaging features. Moreover, sparsity pursuit has extraordinary significance and is more challenging for heterogeneity analysis, because the important features may not be the same in different subgroups, which is ignored by the existing heterogeneity analyses. Fortunately, rich information from previous literature (for example, those deposited in PubMed) can be used to assist feature selection in the present study. Advancing from the existing analyses, in this study, we propose a novel sparse hierarchical heterogeneity analysis framework, which can integrate two types of features and incorporate prior knowledge to improve feature selection. The proposed approach has satisfactory statistical properties and competitive numerical performance. A TCGA real data analysis demonstrates the practical value of our approach in analyzing data heterogeneity and sparsity.

Funder

National Science Foundation of Sri Lanka

National Institutes of Health

National Natural Science Foundation of China

Publisher

Wiley

Reference26 articles.

1. Non-small-cell lung cancers: a heterogeneous set of diseases

2. Triple-negative breast cancer: challenges and opportunities of a heterogeneous disease

3. Learning individualized treatment rules with many treatments: a supervised clustering approach using adaptive fusion;Ma H;Adv Neural Inf Process Syst,2022

4. Variable Selection in Finite Mixture of Regression Models

5. ℓ1-penalization for mixture regression models

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3