Multi-Stage Methods for Cost Controlled Data Compression Using Principal Component Analysis
Swarnali Banerjee () and
Bhargab Chattopadhyay
Additional contact information
Swarnali Banerjee: Department of Mathematics and Statistics and Center for Data Science and Consulting, Loyola University, Chicago, IL 60660, USA
Bhargab Chattopadhyay: School of Management & Entrepreneurship, Indian Institute of Technology Jodhpur, Jodhpur 342030, Rajasthan, India
Mathematics, 2025, vol. 13, issue 13, 1-15
Abstract:
Several online principal component analysis (PCA) methodologies exist for data arriving sequentially that focus only on compression risk minimization. Recent work in this realm revolves around minimizing the cost-compression risk, which takes into account compression loss and sampling costs using a two-stage PCA procedure. Even though the procedure enjoys first-order efficiency, the authors could not mathematically verify the existence of the second-order efficiency property. In this article, we minimize cost-compression risk using a modified two-stage PCA procedure, which takes into account the data compression loss as well as the sampling cost when the smallest eigenvalue of the population variance–covariance matrix or its positive lower bound is known when the data is assumed to follow a multivariate normal distribution. The modified two-stage PCA procedure is shown to possess the second-order efficiency property, among others, including the second-order risk efficiency property under some conditions. The proposed method is novel but also fast and efficient, as illustrated by extensive data analyses through simulations and real data analysis.
Keywords: cost-compression risk; efficiency; risk; two-stage methodology (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/13/2140/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/13/2140/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:13:p:2140-:d:1691185
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().