Concise functional enrichment of ranked gene lists

Author:

Jia Xinglin,Phan An,Kadelka ClausORCID

Abstract

AbstractGenome-wide expression data has become ubiquitous within the last two decades. Given such data, functional enrichment methods identify functional categories (e.g., biological processes) that preferentially annotate differentially expressed genes. However, many existing methods operate in a binary manner, disregarding valuable information contained in the gene ranking. The few methods that consider the ranking often return redundant or non-specific functional categories.To address these limitations, we developed a novel method called Concise Ranked Functional Enrichment (CRFE), which effectively leverages the ranking information in gene expression data to compute a non-redundant set of specific functional categories that are notably enriched for highly ranked genes. A particularly useful feature of CRFE is a tunable parameter that defines how much focus should be given to the most highly ranked genes. Using four treatment-control RNA-seq datasets, we compared the performance of CRFE with the two most widely used types of functional enrichment methods, Gene Set Enrichment Analysis and over-representation analysis. We evaluated the methods based on their ability to utilize ranking information, generate non-redundant results, and return functional categories with high information content. CRFE excelled in all evaluated criteria, outperforming the existing methods, each of which exhibits deficiencies in at least one aspect. Using lung adenocarcinoma data, we further showed that the functional categories identified by CRFE are biologically meaningful.In conclusion, CRFE computes an informative set of functional categories that summarizes genome-wide expression data. With its superior performance over existing methods, CRFE harbors great promise to become a widely used functional enrichment method.Author summaryGiven a list of differentially expressed genes as input, functional enrichment methods reveal which functional categories (e.g., biological processes) were likely activated by the cell and are responsible for the differential expression. We developed a new such method, called Concise Ranked Functional Enrichment (CRFE), which addresses the limitations of current approaches by incorporating gene ranking information to compute a concise and specific set of enriched functional categories. Using four treatment-control RNA-seq datasets, we evaluate how well CRFE and the two currently most widely used methods perform in three criteria. We find that CRFE outperforms each of the alternative methods in at least one of the evaluated criteria, demonstrating its superiority. A high-level interpretation of the functional categories identified by CRFE for lung adenocarcinoma datasets highlights its usefulness for experimentalists. Overall, CRFE harnesses the power of ranked gene lists to generate a focused and non-redundant set of enriched functional categories. Our study positions CRFE as a promising method for functional enrichment analysis, with the potential to advance research in this field.

Publisher

Cold Spring Harbor Laboratory

Reference49 articles.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3