EconPapers    
Economics at your fingertips  
 

Fast regularized canonical correlation analysis

Raul Cruz-Cano and Mei-Ling Ting Lee

Computational Statistics & Data Analysis, 2014, vol. 70, issue C, 88-100

Abstract: Canonical correlation analysis is a popular statistical method for the study of the correlations between two sets of variables. Finding the canonical correlations between these datasets requires the inversion of their corresponding sample correlation matrices. When the number of variables is large compared to the number of experimental units it is impossible to calculate the inverse of these matrices directly and therefore it is necessary to add a multiple of the identity matrix to them. This procedure is known as regularization. In this paper we present an alternative method to the existing regularization algorithm. The proposed method is based on the estimates of the correlation matrices which minimize the mean squared error risk function. The solution of this optimization problem can be found analytically and consists of a small set of computationally inexpensive equations. We also present material which shows that the proposed method is more stable and provides more accurate results than the standard regularized canonical correlation method. Finally, the application of our original method to NCI-60 microRNA cancer data proves that it can deliver useful insights in study cases which involve hundreds of variables.

Keywords: Canonical correlation analysis; Regularization; Regularized canonical correlation; Shrinkage of covariance matrix; NCI-60 microRNA expression data (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (3)

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0167947313003447
Full text for ScienceDirect subscribers only.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:70:y:2014:i:c:p:88-100

DOI: 10.1016/j.csda.2013.09.020

Access Statistics for this article

Computational Statistics & Data Analysis is currently edited by S.P. Azen

More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:csdana:v:70:y:2014:i:c:p:88-100