Log-PCA versus Geodesic PCA of histograms in the Wasserstein space
Elsa Cazelles (),
Vivien Seguy (),
Jérémie Bigot (),
Marco Cuturi () and
Nicolas Papadakis ()
Additional contact information
Elsa Cazelles: Institut de Mathématiques de Bordeaux; CNRS; IMB-UMR5251; Université de Bordeaux
Vivien Seguy: Graduate School of Informatics; Kyoto University
Jérémie Bigot: Institut de Mathématiques de Bordeaux; CNRS; IMB-UMR5251; Université de Bordeaux
Marco Cuturi: CREST; ENSAE; Université Paris-Saclay
Nicolas Papadakis: Institut de Mathématiques de Bordeaux ; CNRS; IMB-UMR5251; Université de Bordeaux
No 2017-85, Working Papers from Center for Research in Economics and Statistics
Abstract:
This paper is concerned by the statistical analysis of data sets whose elements are random histograms. For the purpose of learning principal modes of variation from such data, we consider the issue of computing the PCA of histograms with respect to the 2-Wasserstein distance between probability measures. To this end, we propose to compare the methods of log-PCA and geodesic PCA in the Wasserstein space as introduced in [BGKL15, SC15]. Geodesic PCA involves solving a non-convex optimization problem. To solve it approximately, we propose a novel forward-backward algorithm. This allows a detailed comparison between log-PCA and geodesic PCA of one-dimensional histograms, which we carry out using various datasets, and stress the bene ts and drawbacks of each method. We extend these ;Classification-JEL: 62-07, 68R10, 62H25
Keywords: Geodesic Principal Componant Analysis; Wasserstein Space; Non-convex optimization (search for similar items in EconPapers)
Pages: 32 pages
Date: 2017-08-27
References: View complete reference list from CitEc
Citations: View citations in EconPapers (7)
Downloads: (external link)
http://crest.science/RePEc/wpstorage/2017-85.pdf CREST working paper version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:crs:wpaper:2017-85
Access Statistics for this paper
More papers in Working Papers from Center for Research in Economics and Statistics Contact information at EDIRC.
Bibliographic data for series maintained by Secretariat General () and Murielle Jules Maintainer-Email : murielle.jules@ensae.Fr.