EconPapers    
Economics at your fingertips  
 

Genomic Prediction of Wheat Grain Yield Using Machine Learning

Manisha Sanjay Sirsat (), Paula Rodrigues Oblessuc and Ricardo S. Ramiro
Additional contact information
Manisha Sanjay Sirsat: Department of Data Management and Risk Analysis, InnovPlantProtect, 7350-478 Elvas, Portugal
Paula Rodrigues Oblessuc: Department of Protection of Specific Crops, InnovPlantProtect, 7350-478 Elvas, Portugal
Ricardo S. Ramiro: Department of Data Management and Risk Analysis, InnovPlantProtect, 7350-478 Elvas, Portugal

Agriculture, 2022, vol. 12, issue 9, 1-12

Abstract: Genomic Prediction (GP) is a powerful approach for inferring complex phenotypes from genetic markers. GP is critical for improving grain yield, particularly for staple crops such as wheat and rice, which are crucial to feeding the world. While machine learning (ML) models have recently started to be applied in GP, it is often unclear what are the best algorithms and how their results are affected by the feature selection (FS) methods. Here, we compared ML and deep learning (DL) algorithms with classical Bayesian approaches, across a range of different FS methods, for their performance in predicting wheat grain yield (in three datasets). Model performance was generally more affected by the prediction algorithm than the FS method. Among all models, the best performance was obtained for tree-based ML methods (random forests and gradient boosting) and for classical Bayesian methods. However, the latter was prone to fitting problems. This issue was also observed for models developed with features selected by BayesA, the only Bayesian FS method used here. Nonetheless, the three other FS methods led to models with no fitting problem but similar performance. Thus, our results indicate that the choice of prediction algorithm is more important than the choice of FS method for developing highly predictive models. Moreover, we concluded that random forests and gradient boosting algorithms generate highly predictive and robust wheat grain yield GP models.

Keywords: genomic prediction; machine learning; random forests; gradient boosting; Bayesian methods; penalized regression; deep learning (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2077-0472/12/9/1406/pdf (application/pdf)
https://www.mdpi.com/2077-0472/12/9/1406/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:12:y:2022:i:9:p:1406-:d:908084

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jagris:v:12:y:2022:i:9:p:1406-:d:908084