Prioritizing Complex Disease Genes from Heterogeneous Public Databases

Author:

Gong Eric,Chen Jake Y.ORCID

Abstract

AbstractBackgroundComplex human diseases are defined not only by sophisticated patterns of genetic variants/mutations upstream but also by many interplaying genes, RNAs, and proteins downstream. Analyzing multiple genomic and functional genomic data types to determine a short list of genes or molecules of interest is a common task called “gene prioritization” in biology. There are many statistical, biological, and bioinformatic methods developed to perform gene prioritization tasks. However, little research has been conducted to examine the relationships among the technique used, merged/separate use of each data modality, the gene list’s network/pathway context, and various gene ranking/expansions.MethodsWe introduce a new analytical framework called “Gene Ranking and Iterative Prioritization based on Pathways” (GRIPP) to prioritize genes derived from different modalities. Multiple data sources, such as CBioPortal, PAGER, and COSMIC were used to compile the initial gene list. We used the PAGER software to expand the gene list based on biological pathways and the BEERE software to construct protein-protein interaction networks that include the gene list to rank order genes. We produced a final gene list for each data modality iteratively from an initial draft gene list, using glioblastoma multiform (GBM) as a case study.ConclusionWe demonstrated that GBM gene lists obtained from three modalities (differential gene expressions, gene mutations, and copy number alterations) and several data sources could be iteratively expanded and ranked using GRIPP. While integrating various modalities of data can be useful to generate an integrated ranked gene list related to any specific disease, the integration may also decrease the overall significance of ranked genes derived from specific data modalities. Therefore, we recommend carefully sorting and integrating gene lists according to each modality, such as gene mutations, epigenetic controls, or differential expressions, to procure modality-specific biological insights into the prioritized genes.

Publisher

Cold Spring Harbor Laboratory

Reference24 articles.

1. Text mining in cancer gene and pathway prioritization;Cancer Inform,2014

2. Disease gene-fishing in molecular interaction networks: a case study in colorectal cancer;Annu Int Conf IEEE Eng Med Biol Soc,2009

3. The Molecular Signatures Database (MSigDB) hallmark gene set collection;Cell Syst,2015

4. RNA editing of nuclear transcripts in Arabidopsis thaliana

5. Redefining breast cancer subtypes to guide treatment prioritization and maximize response: Predictive biomarkers across 10 cancer therapies;Cancer Cell,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3