Variances and covariances of linear summary statistics of segregating sites
Yun-Xin Fu
Theoretical Population Biology, 2022, vol. 145, issue C, 95-108
Abstract:
Each mutation in a population sample of DNA sequences can be classified by the number of sequences that inherit the mutant nucleotide, the resulting frequencies are known as mutations of different sizes or site frequency spectrum. Many summary statistics can be defined as a linear function of these frequencies. A flexible class of such linear summary statistics is explored analytically in this paper which include several well-known quantities, such as the number of segregating sizes and the mean number of nucleotide differences between two sequences. Some asymptotic variances and covariances are obtained while the analytical formulas for the variances and covariances of nine such linear summary statistics are derived, most of which are unknown to date. This study not only provides some theoretical foundations for exploring linear summary statistics, but also provides some newlinear summary statistics that may be utilized for analyzing sample polymorphism. Furthermore it is showed that a newly developed linear summary statistics has a smaller variance almost uniformly than Watterson’s estimator, and that a class of linear summary statistics given too heavy weights on mutations of smaller sizes result in asymptotically non-zero variance.
Keywords: Coalescent; Segregating sites; Mutation size; Linear summary statistics; Î -statistics; Variance and covariance (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0040580922000284
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:thpobi:v:145:y:2022:i:c:p:95-108
DOI: 10.1016/j.tpb.2022.03.005
Access Statistics for this article
Theoretical Population Biology is currently edited by Jeremy Van Cleve
More articles in Theoretical Population Biology from Elsevier
Bibliographic data for series maintained by Catherine Liu ().