Optimal subset selection for distributed local principal component analysis
Guangbao Guo and
Guoqi Qian
Physica A: Statistical Mechanics and its Applications, 2025, vol. 658, issue C
Abstract:
Given that distributed PCA methods may sometimes produce large local approximation error, we propose a novel distributed PCA method, called distributed local PCA, to reduce the error by dimensionality reduction with an optimal subset selection criterion. The advantages of our optimal subset selection for the DLCPA include enhanced accuracy through precise covariance estimation, efficiency in handling large data sets, scalability in managing variable-exceeding-node scenarios, robustness against outliers, flexibility in parameter selection, and adaptability across data distributions. The involved low-dimensional covariance sub-estimators are obtained by computing their local principal components in distributed manner, and the one-step average covariance estimator is computed. Besides, mean squared error (MSE) is selected to measure the performance of the proposed method, and an optimal sub-estimator from the optimal criterion is obtained as having the minimum MSE value among all covariance sub-estimators. It is shown that the proposed method can not only improve the estimation accuracies in both simulated and real data experiments, but also greatly save computing time, which has a good promotion value in tackling big data problems.
Keywords: Data reduction; Distributed inference; Computational complexity; Local principal component analysis (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378437124008185
Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:phsmap:v:658:y:2025:i:c:s0378437124008185
DOI: 10.1016/j.physa.2024.130308
Access Statistics for this article
Physica A: Statistical Mechanics and its Applications is currently edited by K. A. Dawson, J. O. Indekeu, H.E. Stanley and C. Tsallis
More articles in Physica A: Statistical Mechanics and its Applications from Elsevier
Bibliographic data for series maintained by Catherine Liu ().