EconPapers    
Economics at your fingertips  
 

Robustness of the linear mixed effects model to error distribution assumptions and the consequences for genome-wide association studies

Warrington Nicole M. (), Tilling Kate, Howe Laura D., Paternoster Lavinia, Pennell Craig E., Wu Yan Yan and Briollais Laurent
Additional contact information
Warrington Nicole M.: School of Women’s and Infants’ Health, The University of Western Australia, Perth, Western Australia, Australia University of Queensland Diamantina Institute, Translational Research Institute, Brisbane, Queensland, Australia
Tilling Kate: School of Social and Community Medicine, University of Bristol, Bristol, UK MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
Howe Laura D.: School of Social and Community Medicine, University of Bristol, Bristol, UK MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
Paternoster Lavinia: School of Social and Community Medicine, University of Bristol, Bristol, UK MRC Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
Pennell Craig E.: School of Women’s and Infants’ Health, The University of Western Australia, Perth, Western Australia, Australia
Wu Yan Yan: Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
Briollais Laurent: Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada

Statistical Applications in Genetics and Molecular Biology, 2014, vol. 13, issue 5, 567-587

Abstract: Genome-wide association studies have been successful in uncovering novel genetic variants that are associated with disease status or cross-sectional phenotypic traits. Researchers are beginning to investigate how genes play a role in the development of a trait over time. Linear mixed effects models (LMM) are commonly used to model longitudinal data; however, it is unclear if the failure to meet the models distributional assumptions will affect the conclusions when conducting a genome-wide association study. In an extensive simulation study, we compare coverage probabilities, bias, type 1 error rates and statistical power when the error of the LMM is either heteroscedastic or has a non-Gaussian distribution. We conclude that the model is robust to misspecification if the same function of age is included in the fixed and random effects. However, type 1 error of the genetic effect over time is inflated, regardless of the model misspecification, if the polynomial function for age in the fixed and random effects differs. In situations where the model will not converge with a high order polynomial function in the random effects, a reduced function can be used but a robust standard error needs to be calculated to avoid inflation of the type 1 error. As an illustration, a LMM was applied to longitudinal body mass index (BMI) data over childhood in the ALSPAC cohort; the results emphasised the need for the robust standard error to ensure correct inference of associations of longitudinal BMI with chromosome 16 single nucleotide polymorphisms.

Keywords: ALSPAC; genome-wide association; longitudinal studies; misspecificiation; mixed model; robustness (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/sagmb-2013-0066 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:13:y:2014:i:5:p:21:n:4

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html

DOI: 10.1515/sagmb-2013-0066

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:sagmbi:v:13:y:2014:i:5:p:21:n:4