EconPapers    
Economics at your fingertips  
 

Interpretable Model Summaries Using the Wasserstein Distance

Eric Dunipace and Lorenzo Trippa

The American Statistician, 2025, vol. 79, issue 4, 421-434

Abstract: Large-parameter statistical and machine learning models are widely used in many fields but often lack interpretability. This limits the ability of practitioners to make informed decisions based on models; this can be especially challenging in multivariate analyses or Bayesian inference where there is a distribution of predictions to summarize. In response to these challenges, we propose a new method that uses the Wasserstein distance to find low-dimensional linear models that approximate the predictions of complex multivariate models, effectively summarizing them in a way that prioritizes the preservation of their predictive distributions. These summaries can facilitate the communication and understanding of complex models by practitioners in various fields and we provide diagnostic tools to assess their performance. We demonstrate our method on simulated data with different data generating processes and also apply our method to a Bayesian additive regression tree model that predicts survival time for glioblastoma multiforme (GBM) patients.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/00031305.2025.2551223 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:amstat:v:79:y:2025:i:4:p:421-434

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/UTAS20

DOI: 10.1080/00031305.2025.2551223

Access Statistics for this article

The American Statistician is currently edited by Eric Sampson

More articles in The American Statistician from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-12-13
Handle: RePEc:taf:amstat:v:79:y:2025:i:4:p:421-434