Informative or Noninformative Calls for Gene Expression: A Latent Variable Approach

Adetayo, Kasim; Dan, Lin; Van Sanden, Suzy; Djork-Arné, Clevert; Luc, Bijnens; Hinrich, Göhlmann; Dhammika, Amaratunga; Sepp, Hochreiter; Ziv, Shkedy; Willem, Talloen

Informative or Noninformative Calls for Gene Expression: A Latent Variable Approach

Kasim Adetayo, Lin Dan, Suzy Van Sanden, Clevert Djork-Arné, Bijnens Luc, Göhlmann Hinrich, Amaratunga Dhammika, Hochreiter Sepp, Shkedy Ziv and Talloen Willem
Additional contact information
Kasim Adetayo: Universiteit Hasselt & Katholieke Universiteit Leuven
Lin Dan: Universiteit Hasselt & Katholieke Universiteit Leuven
Suzy Van Sanden: Universiteit Hasselt & Katholieke Universiteit Leuven
Clevert Djork-Arné: Johannes Kepler University Linz & Charité - Universitätsmedizin Berlin
Bijnens Luc: Janssen Pharmaceutica N. V., Beerse
Göhlmann Hinrich: Janssen Pharmaceutica N. V., Beerse
Amaratunga Dhammika: Johnson & Johnson Pharmaceutical Research & Development, Raritan
Hochreiter Sepp: Johannes Kepler University Linz
Shkedy Ziv: Universiteit Hasselt & Katholieke Universiteit Leuven
Talloen Willem: Janssen Pharmaceutica N. V., Beerse

Statistical Applications in Genetics and Molecular Biology, 2010, vol. 9, issue 1, 31

Abstract: The strength and weakness of microarray technology can be attributed to the enormous amount of information it is generating. To fully enhance the benefit of microarray technology for testing differentially expressed genes and classification, there is a need to minimize the amount of irrelevant genes present in microarray data. A major interest is to use probe-level data to call genes informative or noninformative based on the trade-off between the array-to-array variability and the measurement error. Existing works in this direction include filtering likely uninformative sets of hybridization (FLUSH; Calza et al., 2007) and I/NI calls for the exclusion of noninformative genes using FARMS (I/NI calls; Talloen et al., 2007; Hochreiter et al., 2006). In this paper, we propose a linear mixed model as a more flexible method that performs equally good as I/NI calls and outperforms FLUSH. We also introduce other criteria for gene filtering, such as, R2 and intra-cluster correlation. Additionally, we include some objective criteria based on likelihood ratio testing, the Akaike information criteria (AIC; Akaike, 1973) and the Bayesian information criterion (BIC; Schwarz, 1978 ).Based on the HGU-133A Spiked-in data set, it is shown that the linear mixed model approach outperforms FLUSH, a method that filters genes based on a quantile regression. The linear model is equivalent to a factor analysis model when either the factor loadings are set to a constant with the variance of the latent factor equal to one, or if the factor loadings are set to one together with unconstrained variance of the latent factor. Filtering based on conditional variance calls a probe set informative when the intensity of one or more probes is consistent across the arrays, while filtering using R2 or intra-cluster correlation calls a probe set informative only when average intensity of a probe set is consistent across the arrays. Filtering based on likelihood ratio test AIC and BIC are less stringent compared to the other criteria.

Keywords: gene filtering; factor analysis; linear mixed model (search for similar items in EconPapers)
Date: 2010
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://doi.org/10.2202/1544-6115.1460 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:9:y:2010:i:1:n:4

Ordering information: This journal article can be ordered from
https://www.degruyte ... urnal/key/sagmb/html

DOI: 10.2202/1544-6115.1460

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().