Non-parametric Bayesian updating and windowing with kernel density and the kudzu algorithm
Robert L. Grant
International Journal of Computational Economics and Econometrics, 2022, vol. 12, issue 4, 405-428
Abstract:
The concept of 'updating' parameter estimates and predictions as more data arrive is an important attraction for people adopting Bayesian methods, and essential in big data settings. Implementation via the hyperparameters of a joint prior distribution is challenging. This paper considers non-parametric updating, using a previous posterior sample as a new prior sample. Streaming data can be analysed in a moving window of time by subtracting old posterior sample(s) with appropriate weights. We evaluate three forms of kernel density, a sampling importance resampling implementation, and a novel algorithm called kudzu, which smooths density estimation trees. Methods are tested for distortion of illustrative prior distributions, long-run performance in a low-dimensional simulation study, and feasibility with a realistically large and fast dataset of taxi journeys. Kernel estimation appears to be useful in low-dimensional problems, and kudzu in high-dimensional problems, but careful tuning and monitoring is required. Areas for further research are outlined.
Keywords: Bayesian data analysis; big data; density estimation trees; kernel density estimation; non-parametric statistics; streaming data. (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=126320 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijcome:v:12:y:2022:i:4:p:405-428
Access Statistics for this article
More articles in International Journal of Computational Economics and Econometrics from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().