PCAN: Probabilistic correlation analysis of two non‐normal data sets
Roger S. Zoh,
Bani Mallick,
Ivan Ivanov,
Veera Baladandayuthapani,
Ganiraju Manyam,
Robert S. Chapkin,
Johanna W. Lampe and
Raymond J. Carroll
Biometrics, 2016, vol. 72, issue 4, 1358-1368
Abstract:
Most cancer research now involves one or more assays profiling various biological molecules, e.g., messenger RNA and micro RNA, in samples collected on the same individuals. The main interest with these genomic data sets lies in the identification of a subset of features that are active in explaining the dependence between platforms. To quantify the strength of the dependency between two variables, correlation is often preferred. However, expression data obtained from next‐generation sequencing platforms are integer with very low counts for some important features. In this case, the sample Pearson correlation is not a valid estimate of the true correlation matrix, because the sample correlation estimate between two features/variables with low counts will often be close to zero, even when the natural parameters of the Poisson distribution are, in actuality, highly correlated. We propose a model‐based approach to correlation estimation between two non‐normal data sets, via a method we call Probabilistic Correlations ANalysis, or PCAN. PCAN takes into consideration the distributional assumption about both data sets and suggests that correlations estimated at the model natural parameter level are more appropriate than correlations estimated directly on the observed data. We demonstrate through a simulation study that PCAN outperforms other standard approaches in estimating the true correlation between the natural parameters. We then apply PCAN to the joint analysis of a microRNA (miRNA) and a messenger RNA (mRNA) expression data set from a squamous cell lung cancer study, finding a large number of negative correlation pairs when compared to the standard approaches.
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/biom.12516
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:biomet:v:72:y:2016:i:4:p:1358-1368
Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0006-341X
Access Statistics for this article
More articles in Biometrics from The International Biometric Society
Bibliographic data for series maintained by Wiley Content Delivery ().