Mathematical bounds on r2 and the effect size in case-control genome-wide association studies
Sanjana M. Paye and
Michael D. Edge
Theoretical Population Biology, 2025, vol. 164, issue C, 1-11
Abstract:
Case-control genome-wide association studies (GWAS) are often used to find associations between genetic variants and diseases. When case-control GWAS are conducted, researchers must make decisions regarding how many cases and how many controls to include in the study. Connections between variants and diseases are made using association statistics, including χ2. Previous work in population genetics has shown that LD statistics, including r2, are bounded by the allele frequencies in the population being studied. Since varying the case fraction changes sample allele frequencies, we use the known bounds on r2 to explore how the fraction of cases included in a study can affect statistical power to detect associations. We analyze a simple mathematical model and use simulations to study a quantity proportional to the χ2 noncentrality parameter, which is closely related to r2, under various conditions. Varying the case fraction changes the χ2 noncentrality parameter, and by extension the statistical power, with effects depending on the dominance, penetrance, and frequency of the risk allele. Our framework explains previously observed results, such as asymmetries in power to detect risk vs. protective alleles, and the fact that a balanced sample of cases and controls does not always give the best power to detect associations, particularly for highly penetrant minor risk alleles that are either dominant or recessive. We show by simulation that our results can be used as a rough guide to statistical power for association tests other than χ2 tests of independence.
Keywords: Population genetics; Linkage disequilibrium; Genome-wide association studies; Case control studies; Mathematical bounds (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0040580925000280
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:thpobi:v:164:y:2025:i:c:p:1-11
DOI: 10.1016/j.tpb.2025.04.003
Access Statistics for this article
Theoretical Population Biology is currently edited by Jeremy Van Cleve
More articles in Theoretical Population Biology from Elsevier
Bibliographic data for series maintained by Catherine Liu ().