Statistical Wasserstein distance with rank regularization and spiked structure
Judy Yangjun Lin
Communications in Statistics - Theory and Methods, 2025, vol. 54, issue 19, 6303-6324
Abstract:
In recent years, the optimal transport (OT) model has been successfully applied to machine learning, as the Wasserstein distance in the OT model can compare the distance between probability distributions and reflects the underlying geometry. However, the computation of the OT problem suffers from the sampling noise and curse of dimensionality. The rank regularization OT model has been shown that it can reduce the sampling noise in the computation, but it does not deal with the impact of the curse of dimensionality. In this article, using the idea of principal component analysis (referred to as spiked model), we improve the rank regularization OT model to reduce the impact of data dimension and propose the rank regularization spiked model. In this model, the coupling measure is decomposed by several product measures and each marginal measure of the product measures has low-dimensional structure. The factored coupling ensures the robustness of the Wasserstein distance and can reduce the sampling noise, at the same time the marginal measures are projected to the low-dimensional subspace to resolve the high-dimensional problem. For this model, we study the estimation error of the Wasserstein distance and give two statistical convergence results.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2025.2455943 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:54:y:2025:i:19:p:6303-6324
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20
DOI: 10.1080/03610926.2025.2455943
Access Statistics for this article
Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe
More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().