Economics at your fingertips  

Multi-locus data distinguishes between population growth and multiple merger coalescents

Koskela Jere
Additional contact information
Koskela Jere: Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK

Statistical Applications in Genetics and Molecular Biology, 2018, vol. 17, issue 3, 21

Abstract: We introduce a low dimensional function of the site frequency spectrum that is tailor-made for distinguishing coalescent models with multiple mergers from Kingman coalescent models with population growth, and use this function to construct a hypothesis test between these model classes. The null and alternative sampling distributions of the statistic are intractable, but its low dimensionality renders them amenable to Monte Carlo estimation. We construct kernel density estimates of the sampling distributions based on simulated data, and show that the resulting hypothesis test dramatically improves on the statistical power of a current state-of-the-art method. A key reason for this improvement is the use of multi-locus data, in particular averaging observed site frequency spectra across unlinked loci to reduce sampling variance. We also demonstrate the robustness of our method to nuisance and tuning parameters. Finally we show that the same kernel density estimates can be used to conduct parameter estimation, and argue that our method is readily generalisable for applications in model selection, parameter inference and experimental design.

Keywords: kernel density estimation; multiple merger coalescent; population growth; site frequency spectrum; statistical power; Primary: 92D10; Secondary: 62M02; 62F03 (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations: View citations in EconPapers (1) Track citations by RSS feed

Downloads: (external link) (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link:

Ordering information: This journal article can be ordered from

DOI: 10.1515/sagmb-2017-0011

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

Page updated 2021-05-07
Handle: RePEc:bpj:sagmbi:v:17:y:2018:i:3:p:21:n:2