EconPapers    
Economics at your fingertips  
 

A Rapid Identification Method for Cottonseed Varieties Based on Near-Infrared Spectral and Generative Adversarial Networks

Qingxu Li, Hao Li (), Renhao Liu, Xiaofeng Dong, Hongzhou Zhang and Wanhuai Zhou
Additional contact information
Qingxu Li: College of Computer Science, Anhui University of Finance & Economics, Bengbu 233030, China
Hao Li: College of Computer Science, Anhui University of Finance & Economics, Bengbu 233030, China
Renhao Liu: College of Mechanical and Electrical Engineering, Tarim University, Alar 843300, China
Xiaofeng Dong: College of Mechanical and Electrical Engineering, Tarim University, Alar 843300, China
Hongzhou Zhang: College of Mechanical and Electrical Engineering, Tarim University, Alar 843300, China
Wanhuai Zhou: College of Computer Science, Anhui University of Finance & Economics, Bengbu 233030, China

Agriculture, 2024, vol. 14, issue 12, 1-21

Abstract: China is a major cotton-growing country with numerous cotton varieties, each exhibiting significant differences in yield and fiber quality. However, the current management of cottonseed varieties is disorganized, resulting in severe homogenization and the presence of counterfeit and mislabeled varieties. The detection of cottonseed variety information has become a critical issue for the Chinese cotton industry. In this study, we collected near-infrared (NIR) spectral data from six cottonseed varieties and constructed a GAN for cottonseed NIR data (GAN-CNIRD) model to generate additional cottonseed NIR data. The Euclidean distance method was used to label the generated NIR data according to the characteristics of the true NIR data. We then applied Standard Normal Variate (SNV), Multiplicative Scatter Correction (MSC), and Normalization algorithms to preprocess the combined dataset of generated and real cottonseed NIR data. Feature wavelengths were extracted using Bootstrap Soft Shrinkage (BOSS) and Competitive Adaptive Reweighted Sampling (CARS) algorithms. Subsequently, we developed Linear Discriminant Analysis (LDA), Random subspace method (RSM), and convolutional neural network (CNN) models to classify the cottonseed varieties. The results showed that for the LDA model, the use of feature wavelengths extracted after Normalization-BOSS processing achieved the best performance with an accuracy of 97.00%. For the RSM model, the use of feature wavelengths extracted after SNV-CARS processing achieved the best performance with an accuracy of 98.00%. For the CNN model, the use of feature wavelengths extracted after MSC-CARS processing achieved the best performance with an accuracy of 100.00%. Data augmentation using GAN-CNIRD-generated cottonseed data improved the accuracy of the three optimal models by 6%, 5%, and 6%, respectively. This study provides a crucial reference for the rapid detection of cottonseed variety information and has significant implications for the standardized management of cottonseed varieties.

Keywords: cottonseeds; cottonseed varieties; machine learning; near-infrared spectroscopy; generative adversarial networks (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2077-0472/14/12/2177/pdf (application/pdf)
https://www.mdpi.com/2077-0472/14/12/2177/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:14:y:2024:i:12:p:2177-:d:1532847

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jagris:v:14:y:2024:i:12:p:2177-:d:1532847