Author:
Charmpi Konstantina,Ycart Bernard
Abstract
AbstractGene Set Enrichment Analysis (GSEA) is a basic tool for genomic data treatment. Its test statistic is based on a cumulated weight function, and its distribution under the null hypothesis is evaluated by Monte-Carlo simulation. Here, it is proposed to subtract to the cumulated weight function its asymptotic expectation, then scale it. Under the null hypothesis, the convergence in distribution of the new test statistic is proved, using the theory of empirical processes. The limiting distribution needs to be computed only once, and can then be used for many different gene sets. This results in large savings in computing time. The test defined in this way has been called Weighted Kolmogorov Smirnov (WKS) test. Using expression data from the GEO repository, tested against the MSig Database C2, a comparison between the classical GSEA test and the new procedure has been conducted. Our conclusion is that, beyond its mathematical and algorithmic advantages, the WKS test could be more informative in many cases, than the classical GSEA test.
Subject
Computational Mathematics,Genetics,Molecular Biology,Statistics and Probability
Reference60 articles.
1. Core Team for Statistical Foundation for Statistical www project org ISBN;Vienna;Language Environment Computing Computing Austria,2013
2. expression omnibus gene expression and hybridization array data repository;Edgar;Gene Nucleic Acids Res,2002
3. parametric analysis of gene set enrichment;Kim;BMC Bioinformatics,2005
4. Robust in New York;Héritier;methods biostatistics,2009
5. Nonparametric goodness - of - fit tests for discrete null distributions;Arnold;Journal,2011
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献