Machine learning and optimization based decision-support tool for seed variety selection
Durai Sundaramoorthi () and
Lingxiu Dong
Additional contact information
Durai Sundaramoorthi: Washington University in St. Louis
Lingxiu Dong: Washington University in St. Louis
Annals of Operations Research, 2024, vol. 341, issue 1, No 2, 5-39
Abstract:
Abstract Every year agribusinesses develop and market new seed varieties with traits desirable for different planting environments. When agribusinesses experiment the new varieties at different farms, data is generated about the performance of these new seed varieties. However, farmers do not have a decision support tool to process the vast amount of yield performance data to make an informed seed variety selection decision for their farm. An informed decision requires accurate estimation of yield performances of seed varieties on the targeted farmland and balancing trade-offs between the expected yield and the risk associated with the seed varieties selected to grow. This research uses a real data set provided by Syngenta—an agribusiness—to create a decision-support tool. The data set used in this research contains yield information of different soybean varieties experimented at different farms located in the Midwest of the US, as well as information on location, soil, and weather conditions prevailing in those farms. In addition to this data, we also surveyed soybean farmers to understand their preferences and current practices in choosing seed varieties to grow in their farms. We are the first to capture and document farmers’ preferences and practices in selecting and growing soybean varieties. The data collected from the survey enabled us to compare the results emerging from the proposed methodology with the status quo practices. Using the Syngenta data and survey responses, this paper proposes an analytics framework that integrates machine learning, clustering, simulation, and portfolio optimization to optimally select soybean varieties to grow at the target farm. We choose a machine learning model, which simulates the yield performance of soybean varieties under different plausible weather scenarios derived from the neighborhood of the target farm. The simulated yields are then used to estimate parameters in a portfolio optimization formulation that selects the optimal portfolio of seed varieties to grow at the target farm. The main methodological contribution of this research is in the development of an approach that integrates machine learning, clustering, simulation, and portfolio optimization to help farmers make an important decision. Specifically, we introduce a novel data-driven simulation-based approach to estimate the parameters needed to solve a portfolio optimization problem. Our analysis indicates that an average farmer will gain as much as $177,369 per year in revenue by utilizing the analytics framework introduced in this research. The methodology developed in this research can be applied to variety selection decisions for other crops and influence farming practice positively. By embracing the machine learning and analytics powered framework introduced in this paper, agribusinesses can position themselves as the innovation leader and create business value by unleashing the potential of the scientific discoveries of agronomy to offer tailored farming decision support to individual farmers.
Keywords: Food shortage; Soybean; Data science; Portfolio optimization; Machine learning; Simulation and optimization (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10479-022-04995-8 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:annopr:v:341:y:2024:i:1:d:10.1007_s10479-022-04995-8
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10479
DOI: 10.1007/s10479-022-04995-8
Access Statistics for this article
Annals of Operations Research is currently edited by Endre Boros
More articles in Annals of Operations Research from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().