Ensemble sparse estimation of covariance structure for exploring genetic disease data
Xiaoning Kang and
Mingqiu Wang
Computational Statistics & Data Analysis, 2021, vol. 159, issue C
Abstract:
High-dimensional data often occur nowadays in various areas, such as genetic and microarray data. The covariance matrix is of fundamental importance in analyzing the relationship between multivariate variables. A powerful tool for estimating a covariance matrix is the modified Cholesky decomposition, which allows for unconstrained estimation and guarantees the positive definiteness of the estimate. However, it requires a pre-specified ordering of variables before analysis, which is often not available in the real data. Hence, an ensemble Cholesky-based sparse estimation is proposed for a high-dimensional covariance matrix by adopting the model averaging idea. The asymptotically theoretical convergence rate is established under some regularity conditions. The merits of the proposed model are illustrated by the numerical study and two genetic disease data.
Keywords: High-dimensional; Soft thresholding; Sparsity; Variable ordering (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0167947321000542
Full text for ScienceDirect subscribers only.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:159:y:2021:i:c:s0167947321000542
DOI: 10.1016/j.csda.2021.107220
Access Statistics for this article
Computational Statistics & Data Analysis is currently edited by S.P. Azen
More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().