EconPapers    
Economics at your fingertips  
 

Speeding up MCMC by Efficient Data Subsampling

Robert Kohn (), Matias Quiroz, Minh-Ngoc Tran and Mattias Villani

Working Papers from University of Sydney Business School, Discipline of Business Analytics

Abstract: We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood function for n observations is estimated from a random subset of m observations. We introduce a general and highly efficient unbiased estimator of the log-likelihood based on control variates obtained from clustering the data. The cost of computing the log-likelihood estimator is much smaller than that of the full log-likelihood used by standard MCMC. The likelihood estimate is bias-corrected and used in two correlated pseudo-marginal algorithms to sample from a perturbed posterior, for which we derive the asymptotic error with respect to n and m, respectively. A practical estimator of the error is proposed and we show that the error is negligible even for a very small m in our applications. We demonstrate that Subsampling MCMC is substantially more efficient than standard MCMC in terms of sampling efficiency for a given computational budget, and that it outperforms other subsampling methods for MCMC proposed in the literature.

Keywords: Survey sampling; Big Data; Block pseudo-marginal; Estimated likelihood; Correlated pseudo-marginal; Bayesian inference (search for similar items in EconPapers)
Date: 2016
New Economics Papers: this item is included in nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/2123/16205

Related works:
Journal Article: Speeding Up MCMC by Efficient Data Subsampling (2019) Downloads
Working Paper: SPEEDING UP MCMC BY EFFICIENT DATA SUBSAMPLING (2015) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:syb:wpbsba:2123/16205

Access Statistics for this paper

More papers in Working Papers from University of Sydney Business School, Discipline of Business Analytics Contact information at EDIRC.
Bibliographic data for series maintained by Artem Prokhorov ().

 
Page updated 2025-03-24
Handle: RePEc:syb:wpbsba:2123/16205