Estimating the scaled mutation rate and mutation bias with site frequency data
Claus Vogl
Theoretical Population Biology, 2014, vol. 98, issue C, 19-27
Abstract:
The distribution of allele frequencies of a large number of biallelic sites is known as “allele-frequency spectrum†or “site-frequency spectrum†(SFS). Without selection and in regions of relatively high recombination rates, sites may be assumed to be independently and identically distributed. With a beta equilibrium distribution of allelic proportions and binomial sampling, a beta–binomial compound likelihood for each site results. The likelihood of the data and the posterior distribution of two parameters, scaled mutation rate θ and mutation bias α, is investigated in the general case and for small scaled mutation rates θ. In the general case, an expectation–maximization (EM) algorithm is derived to obtain maximum likelihood estimates of both parameters. With an appropriate prior distribution, a Markov chain Monte Carlo sampler to integrate the posterior distribution is also derived. As far as I am aware, previous maximum likelihood or Bayesian estimators of θ, explicitly or implicitly assume small scaled mutation rates, i.e., θ≪1. For θ≪1, maximum-likelihood estimators are also derived for both parameters using a Taylor series expansion of the beta–binomial distribution. The estimator of θ is a variant of the Ewens–Watterson estimator and of the maximum likelihood estimator derived with the Poisson Random Field approach. With a conjugate prior distribution, marginal and conditional beta posterior distributions are also derived for both parameters.
Keywords: Mutation–drift equilibrium; Beta–binomial; Stirling distribution; EM-algorithm; Markov chain Monte Carlo algorithm; Posterior (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc 
Citations: View citations in EconPapers (6) 
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0040580914000793
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX 
RIS (EndNote, ProCite, RefMan) 
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:thpobi:v:98:y:2014:i:c:p:19-27
DOI: 10.1016/j.tpb.2014.10.002
Access Statistics for this article
Theoretical Population Biology is currently edited by Jeremy Van Cleve
More articles in Theoretical Population Biology  from  Elsevier
Bibliographic data for series maintained by Catherine Liu ().