EconPapers    
Economics at your fingertips  
 

Testing for Associations of Opposite Directionality in a Heterogeneous Population

Fangyuan Zhang, Jie Ding and Shili Lin ()
Additional contact information
Fangyuan Zhang: Texas Tech University
Jie Ding: Stanford University
Shili Lin: The Ohio State University

Statistics in Biosciences, 2017, vol. 9, issue 1, No 8, 137-159

Abstract: Abstract In gene networks, it is possible that the patterns of gene co-expression may exist only in a subset of the sample. In studies of relationships between genotypes and expressions of genes over multiple tissues, there may be associations in some tissues but not in the others. Despite the importance of the problem in genomic applications, it is challenging to identify relationships between two variables when the correlation may only exist in a subset of the sample. The situation becomes even less tractable when there exist two subsets in which correlations are in opposite directions. By ranking subset relationships according to Kendall’s tau, a tau-path can be derived to facilitate the identification of correlated subsets, if such subsets exist. However, the current tau-path methodology only considers the situation in which there is association in a subsample; the more complex scenario depicting the existence of two subsets with opposite directionality of associations was not addressed. Further, existing algorithms for finding tau-paths may be suboptimal given their greedy nature. In this paper, we extend the tau-path methodology to accommodate the situation in which the sample may be drawn from a heterogeneous population composed of subpopulations portraying positive and negative associations. We also propose the use of a cross entropy Monte Carlo procedure to obtain an optimal tau-path, CEMC $$_{tp}$$ t p . The algorithm not only can provide simultaneous detection of positive and negative correlations in the same sample, but also can lead to the identification of subsamples that provide evidence for the detected associations. An extensive simulation study shows the aptness of CEMC $$_{tp}$$ t p for detecting associations under various scenarios. Compared with two standard tests for detecting associations, CEMC $$_{tp}$$ t p is seen to be more powerful when there are indeed complex subset associations with well-controlled type-I error rates. We applied CEMC $$_{tp}$$ t p to the NCI-60 gene expression data to illustrate its utility for uncovering network relationships that were missed with standard methods.

Keywords: Cross entropy Monte Carlo (CEMC); Tau-path; Heterogeneous sample; Subset associations; Gene networks (search for similar items in EconPapers)
Date: 2017
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s12561-016-9167-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:stabio:v:9:y:2017:i:1:d:10.1007_s12561-016-9167-7

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/12561

DOI: 10.1007/s12561-016-9167-7

Access Statistics for this article

Statistics in Biosciences is currently edited by Hongyu Zhao and Xihong Lin

More articles in Statistics in Biosciences from Springer, International Chinese Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:stabio:v:9:y:2017:i:1:d:10.1007_s12561-016-9167-7