Semi-automatic selection of summary statistics for ABC model choice
Prangle Dennis (),
Fearnhead Paul,
Cox Murray P.,
Biggs Patrick J. and
French Nigel P.
Additional contact information
Prangle Dennis: Department of Mathematics and Statistics, Lancaster University, UK
Fearnhead Paul: Department of Mathematics and Statistics, Lancaster University, UK
Cox Murray P.: Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand Institute of Fundamental Sciences, Massey University, Palmerston North, New Zealand
Biggs Patrick J.: Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand
French Nigel P.: Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand Infectious Disease Research Centre, Institute of Veterinary, Animal and Biomedical Sciences, Massey University, Palmerston North, New Zealand
Statistical Applications in Genetics and Molecular Biology, 2014, vol. 13, issue 1, 67-82
Abstract:
A central statistical goal is to choose between alternative explanatory models of data. In many modern applications, such as population genetics, it is not possible to apply standard methods based on evaluating the likelihood functions of the models, as these are numerically intractable. Approximate Bayesian computation (ABC) is a commonly used alternative for such situations. ABC simulates data x for many parameter values under each model, which is compared to the observed data xobs. More weight is placed on models under which S(x) is close to S(xobs), where S maps data to a vector of summary statistics. Previous work has shown the choice of S is crucial to the efficiency and accuracy of ABC. This paper provides a method to select good summary statistics for model choice. It uses a preliminary step, simulating many x values from all models and fitting regressions to this with the model as response. The resulting model weight estimators are used as S in an ABC analysis. Theoretical results are given to justify this as approximating low dimensional sufficient statistics. A substantive application is presented: choosing between competing coalescent models of demographic growth for Campylobacter jejuni in New Zealand using multi-locus sequence typing data.
Keywords: ABC; model selection; sufficiency; Campylobacter; MLST; coalescent (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)
Downloads: (external link)
https://doi.org/10.1515/sagmb-2013-0012 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:13:y:2014:i:1:p:67-82:n:5
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2013-0012
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().