EconPapers    
Economics at your fingertips  
 

Sensitivity to prior specification in Bayesian genome-based prediction models

Lehermeier Christina, Wimmer Valentin, Albrecht Theresa, Auinger Hans-Jürgen, Gianola Daniel, Schmid Volker J. and Schön Chris-Carolin ()
Additional contact information
Lehermeier Christina: Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany
Wimmer Valentin: Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany
Albrecht Theresa: Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany Current address: Institute for Crop Production and Plant Breeding, Bavarian State Research Center for Agriculture, Am Gereuth 6, 85354 Freising, Germany
Auinger Hans-Jürgen: Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany
Gianola Daniel: Department of Animal Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA; and Institute for Advanced Study, Technische Universität München, Lichtenbergstraße 2a, 85748 Garching, Germany
Schmid Volker J.: Department of Statistics, Ludwig-Maximilians-Universität München, Ludwigstraße 33, 80539 München, Germany
Schön Chris-Carolin: Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany

Statistical Applications in Genetics and Molecular Biology, 2013, vol. 12, issue 3, 375-391

Abstract: Different statistical models have been proposed for maximizing prediction accuracy in genome-based prediction of breeding values in plant and animal breeding. However, little is known about the sensitivity of these models with respect to prior and hyperparameter specification, because comparisons of prediction performance are mainly based on a single set of hyperparameters. In this study, we focused on Bayesian prediction methods using a standard linear regression model with marker covariates coding additive effects at a large number of marker loci. By comparing different hyperparameter settings, we investigated the sensitivity of four methods frequently used in genome-based prediction (Bayesian Ridge, Bayesian Lasso, BayesA and BayesB) to specification of the prior distribution of marker effects. We used datasets simulated according to a typical maize breeding program differing in the number of markers and the number of simulated quantitative trait loci affecting the trait. Furthermore, we used an experimental maize dataset, comprising 698 doubled haploid lines, each genotyped with 56110 single nucleotide polymorphism markers and phenotyped as testcrosses for the two quantitative traits grain dry matter yield and grain dry matter content. The predictive ability of the different models was assessed by five-fold cross-validation. The extent of Bayesian learning was quantified by calculation of the Hellinger distance between the prior and posterior densities of marker effects. Our results indicate that similar predictive abilities can be achieved with all methods, but with BayesA and BayesB hyperparameter settings had a stronger effect on prediction performance than with the other two methods. Prediction performance of BayesA and BayesB suffered substantially from a non-optimal choice of hyperparameters.

Keywords: genome-based prediction; genomic selection; Bayesian learning; shrinkage prior; plant breeding (search for similar items in EconPapers)
Date: 2013
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/sagmb-2012-0042 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:12:y:2013:i:3:p:375-391:n:1006

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html

DOI: 10.1515/sagmb-2012-0042

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:sagmbi:v:12:y:2013:i:3:p:375-391:n:1006