Economics at your fingertips  

Treating Expression Levels of Different Genes as a Sample in Microarray Data Analysis: Is it Worth a Risk?

Klebanov Lev and Yakovlev Andrei
Additional contact information
Klebanov Lev: Department of Probability and Statistics, Charles University
Yakovlev Andrei: University of Rochester, Rochester, NY

Statistical Applications in Genetics and Molecular Biology, 2006, vol. 5, issue 1, 1-11

Abstract: One of the prevailing ideas in the literature on microarray data analysis is to pool the expression measures across genes and treat them as a sample drawn from some distribution. Several universal laws were proposed to analytically describe this distribution. This idea raises a number of concerns. The expression levels of genes are not identically distributed random variables so that treating them as a sample amounts to sampling from a mixture of equally weighted distributions, each being associated with a different gene. The expression levels of different genes are heavily dependent random variables so that the law of large numbers and statistical goodness-of-fit tests are normally inapplicable to this kind of data. This dependence represents a very serious pitfall in microarray data analysis.

Date: 2006
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1) Track citations by RSS feed

Downloads: (external link) (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link:

Ordering information: This journal article can be ordered from

DOI: 10.2202/1544-6115.1185

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

Page updated 2021-05-07
Handle: RePEc:bpj:sagmbi:v:5:y:2006:i:1:n:9