Weighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis
Charmpi Konstantina and
Ycart Bernard ()
Additional contact information
Charmpi Konstantina: Université Grenoble Alpes, France Laboratoire Jean Kuntzmann, CNRS UMR5224, Grenoble, France Laboratoire d’Excellence TOUCAN, Toulouse, France
Ycart Bernard: Université Grenoble Alpes, France Laboratoire Jean Kuntzmann, CNRS UMR5224, Grenoble, France Laboratoire d’Excellence TOUCAN, Toulouse, France
Statistical Applications in Genetics and Molecular Biology, 2015, vol. 14, issue 3, 279-293
Abstract:
Gene Set Enrichment Analysis (GSEA) is a basic tool for genomic data treatment. Its test statistic is based on a cumulated weight function, and its distribution under the null hypothesis is evaluated by Monte-Carlo simulation. Here, it is proposed to subtract to the cumulated weight function its asymptotic expectation, then scale it. Under the null hypothesis, the convergence in distribution of the new test statistic is proved, using the theory of empirical processes. The limiting distribution needs to be computed only once, and can then be used for many different gene sets. This results in large savings in computing time. The test defined in this way has been called Weighted Kolmogorov Smirnov (WKS) test. Using expression data from the GEO repository, tested against the MSig Database C2, a comparison between the classical GSEA test and the new procedure has been conducted. Our conclusion is that, beyond its mathematical and algorithmic advantages, the WKS test could be more informative in many cases, than the classical GSEA test.
Keywords: empirical processes; GSEA; Monte-Carlo simulation; statistical test; weak convergence (search for similar items in EconPapers)
Date: 2015
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/sagmb-2014-0077 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:14:y:2015:i:3:p:279-293:n:5
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2014-0077
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().