Multivariate Goodness-of-Fit Tests Based on Wasserstein Distance
Marc Hallin,
Gilles Mordant and
Johan Segers ()
Additional contact information
Gilles Mordant: Université catholique de Louvain, LIDAM/ISBA, Belgium
Johan Segers: Université catholique de Louvain, LIDAM/ISBA, Belgium
No 2021005, LIDAM Reprints ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA)
Abstract:
Goodness-of-fit tests based on the empirical Wasserstein distance are proposed for simple and composite null hypotheses involving general multivariate distributions. For group families, the procedure is to be implemented after preliminary reduction of the data via invariance. This property allows for calculation of exact critical values and p-values at finite sample sizes. Applications include testing for location–scale families and testing for families arising from affine transformations, such as elliptical distributions with given standard radial density and unspecified location vector and scatter matrix. A novel test for multivariate normality with unspecified mean vector and covariance matrix arises as a special case. For more general parametric families, we propose a parametric bootstrap procedure to calculate critical values. The lack of asymptotic distribution theory for the empirical Wasserstein distance means that the validity of the parametric bootstrap under the null hypothesis remains a conjecture. Nevertheless, we show that the test is consistent against fixed alternatives. To this end, we prove a uniform law of large numbers for the empirical distribution in Wasserstein distance, where the uniformity is over any class of underlying distributions satisfying a uniform integrability condition but no additional moment assumptions. The calculation of test statistics boils down to solving the well-studied semi-discrete optimal transport problem. Extensive numerical experiments demonstrate the practical feasibility and the excellent performance of the proposed tests for the Wasserstein distance of order p = 1 and p = 2 and for dimensions at least up to d = 5. The simulations also lend support to the conjecture of the asymptotic validity of the parametric bootstrap.
Keywords: Copula; Elliptical distribution; Goodness-of-fit; Group families; Multivariate normality; Optimal transport; Semi-discrete problem; Skew-t distribution; Wasserstein distance (search for similar items in EconPapers)
Date: 2021-01-01
Note: In: Electronic Journal of Statistics, Vol. 15, no. 1, p. 1328-1371 (2021)
References: Add references at CitEc
Citations: View citations in EconPapers (10)
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
Working Paper: Multivariate Goodness-of-Fit Tests Based on Wasserstein Distance (2020) 
Working Paper: Multivariate Goodness-of-Fit Tests Based on Wasserstein Distance (2020) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aiz:louvar:2021005
DOI: 10.1214/21-EJS1816
Access Statistics for this paper
More papers in LIDAM Reprints ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA) Voie du Roman Pays 20, 1348 Louvain-la-Neuve (Belgium). Contact information at EDIRC.
Bibliographic data for series maintained by Nadja Peiffer ().