EconPapers    
Economics at your fingertips  
 

Statistical Wasserstein distance with rank regularization and spiked structure

Judy Yangjun Lin

Communications in Statistics - Theory and Methods, 2025, vol. 54, issue 19, 6303-6324

Abstract: In recent years, the optimal transport (OT) model has been successfully applied to machine learning, as the Wasserstein distance in the OT model can compare the distance between probability distributions and reflects the underlying geometry. However, the computation of the OT problem suffers from the sampling noise and curse of dimensionality. The rank regularization OT model has been shown that it can reduce the sampling noise in the computation, but it does not deal with the impact of the curse of dimensionality. In this article, using the idea of principal component analysis (referred to as spiked model), we improve the rank regularization OT model to reduce the impact of data dimension and propose the rank regularization spiked model. In this model, the coupling measure is decomposed by several product measures and each marginal measure of the product measures has low-dimensional structure. The factored coupling ensures the robustness of the Wasserstein distance and can reduce the sampling noise, at the same time the marginal measures are projected to the low-dimensional subspace to resolve the high-dimensional problem. For this model, we study the estimation error of the Wasserstein distance and give two statistical convergence results.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2025.2455943 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:54:y:2025:i:19:p:6303-6324

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20

DOI: 10.1080/03610926.2025.2455943

Access Statistics for this article

Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe

More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-09-05
Handle: RePEc:taf:lstaxx:v:54:y:2025:i:19:p:6303-6324