Seed quality drives grain yield in Ethiopian and Senegalese sorghum: Insights from machine learning
Ezekiel Ahn,
Louis K Prom,
Jae Hee Jang,
Insuck Baek,
Adama R Tukuli,
Seunghyun Lim,
Seok Min Hong,
Moon S Kim,
Lyndel W Meinhardt,
Sunchung Park and
Clint Magill
PLOS ONE, 2025, vol. 20, issue 8, 1-21
Abstract:
Accurately predicting grain yield remains a major challenge in sorghum breeding, particularly across genetically and geographically diverse germplasm. To address this, we applied a phenotype-informed machine learning (PIML) framework to analyze nine phenotypic traits in 179 Ethiopian and Senegalese accessions. Using hierarchical clustering and oversampling with ADASYN, we achieved high classification accuracy (0.99) for phenotypic group assignment. Grain yield prediction was most effective with a Neural Boosted model (NTanH(3)NBoost(8)), achieving a mean R2 of 0.36 and RASE (equivalent to RMSE) of 4.87. Feature importance analysis consistently identified seed weight and germination rate as the strongest predictors of grain yield, while disease resistance traits showed limited predictive value. These findings suggest that early selection based on seed quality traits may provide a practical strategy for improving sorghum yield under field conditions, especially in resource-limited environments.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0329366 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 29366&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0329366
DOI: 10.1371/journal.pone.0329366
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().