EconPapers    
Economics at your fingertips  
 

Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies

Brady Ryan (), Ananthika Nirmalkanna (), Candemir Cigsar () and Yildiz E. Yilmaz ()
Additional contact information
Brady Ryan: Memorial University of Newfoundland
Ananthika Nirmalkanna: Memorial University of Newfoundland
Candemir Cigsar: Memorial University of Newfoundland
Yildiz E. Yilmaz: Memorial University of Newfoundland

Statistics in Biosciences, 2023, vol. 15, issue 2, No 12, 510-539

Abstract: Abstract In many genetic association analyses, while the aim is to identify genetic variants associated with a given quantitative trait, budgetary constraints prevent genotyping all individuals in a cohort. Selection of individuals for genotyping according to their quantitative trait value can improve cost efficiency. We consider quantitative trait-dependent two-phase sampling designs. In the first phase, trait and inexpensive covariate values for all individuals in a cohort are obtained; in the second phase, genetic sequence data for a subset of individuals are obtained according to their trait values and possibly their inexpensive covariates. We consider the likelihood and pseudo-likelihood methods proposed to analyze response-biased samples, assess their performance under common, low-frequency, and rare variant analyses, compare their efficiencies and investigate efficient response-dependent sampling designs under each method. We also assess robustness of the estimation methods and sampling designs under misspecified models. The results show that extreme sampling is the most efficient design for common variant analysis, and that selecting a small sample from the middle stratum improves accuracy and precision in low-frequency and rare variant analyses. Likelihood methods under an extreme sampling design generally give the most accurate and precise estimates when the model is correctly specified. Both the estimated pseudo-likelihood and pseudo-conditional likelihood methods become more efficient under model misspecification.

Keywords: Likelihood-based methods; Pseudo-conditional likelihood; Estimated pseudo-likelihood; Extreme sampling; Rare variant (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s12561-023-09369-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:stabio:v:15:y:2023:i:2:d:10.1007_s12561-023-09369-7

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/12561

DOI: 10.1007/s12561-023-09369-7

Access Statistics for this article

Statistics in Biosciences is currently edited by Hongyu Zhao and Xihong Lin

More articles in Statistics in Biosciences from Springer, International Chinese Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:stabio:v:15:y:2023:i:2:d:10.1007_s12561-023-09369-7