EconPapers    
Economics at your fingertips  
 

Prediction of Solid Soluble Content of Green Plum Based on Improved CatBoost

Xiao Zhang, Chenxin Zhou, Qi Sun, Ying Liu (), Yutu Yang and Zilong Zhuang
Additional contact information
Xiao Zhang: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China
Chenxin Zhou: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China
Qi Sun: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China
Ying Liu: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China
Yutu Yang: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China
Zilong Zhuang: College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China

Agriculture, 2023, vol. 13, issue 6, 1-12

Abstract: Most green plums need to be processed before consumption, and due to personal subjective factors, manual harvesting and sorting are difficult to achieve using standardized processing. Soluble solid content (SSC) of green plum was taken as the research object in this paper. Visible near-infrared (VIS-NIR) and shortwave near-infrared (SW-NIR) full-spectrum spectral information of green plums were collected, and the spectral data were corrected and pre-processed. Random forest algorithm based on induced random selection (IRS-RF) was proposed to screen four sets of characteristic wavebands. Bayesian optimization CatBoost model (BO-CatBoost) was constructed to predict SSC value of green plums. The experimental results showed that the preprocessing method of multiplicative scatter corrections (MSC) was obviously superior to Savitzky–Golay (S–G), the prediction effect of SSC based on VIS-NIR spectral waveband by partial least squares regression model (PLSR) was obviously superior to SW-NIR spectral waveband, MSC + IRS-RF was obviously superior to corresponding combination of correlation coefficient method (CCM), successive projections algorithm (SPA), competitive adaptive reweighted sampling (CARS), and random forest (RF). With the lowest dimensional selected feature waveband, the lowest VIS-NIR band group was only 53, and the SW-NIR band group was only 100. The model proposed in this paper based on MSC + IRS-RF + BO-CatBoost was superior to PLSR, XGBoost, and CatBoost in predicting SSC, with R 2 P of 0.957, which was 3.1% higher than the traditional PLSR.

Keywords: green plum; spectral technique; SSC; BO-CatBoost; feature band groups (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2077-0472/13/6/1122/pdf (application/pdf)
https://www.mdpi.com/2077-0472/13/6/1122/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:13:y:2023:i:6:p:1122-:d:1156024

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jagris:v:13:y:2023:i:6:p:1122-:d:1156024