EconPapers    
Economics at your fingertips  
 

Detection of Soluble Solids Content (SSC) in Pears Using Near-Infrared Spectroscopy Combined with LASSO–GWF–PLS Model

Baishao Zhan, Peng Li, Ming Li, Wei Luo and Hailiang Zhang ()
Additional contact information
Baishao Zhan: College of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China
Peng Li: College of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China
Ming Li: College of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China
Wei Luo: College of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China
Hailiang Zhang: College of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China

Agriculture, 2023, vol. 13, issue 8, 1-15

Abstract: The soluble solids content (SSC) of pears is mainly composed of sugars, organic acids, and other soluble substances and is one of the important indices used to measure the sweetness and quality of pear juice. The SSC of pears is mainly composed of sugars, organic acids, amino acids, esters, alcohols, phenols, flavonoids, and other compounds, and different groups within these compounds have different characteristic absorption peaks corresponding to different characteristic wavelengths. Traditional methods such as genetic algorithm (GA) and competitive adaptive reweighted sampling (CARS) models used for screening characteristic wavelengths are mainly based on statistical methods, and characteristic wavelengths are selected by finding the wavelengths related to the changes in the concentration of the target analytes. By ignoring the molecular structure and chemical properties of the target analytes and disregarding the influence of the groups of the compounds in the target analytes on the spectral characteristics, wavelengths that are not related to the target analytes may be selected, thus affecting the accuracy of the analytical results. In this paper, a partial least squares (PLS) model was established based on the characteristic wavelengths of CARS, GA, and LASSO algorithms, and the best least absolute shrinkage and selection operator (LASSO) was selected and compared with the characteristic wavelengths selected by group weighted fusion (GWF). The LASSO regression was validated by 10-fold cross-validation to select the appropriate regularization parameter, and the 33 characteristic wavelengths correlated with the SSC of pears were selected in the full spectral range, and the 9 characteristic wavelengths corresponding to the group response were weighted and fused and input into the PLS regression model. Using an established model, the coefficient of determination ( R 2 ) and the root mean square error (RMSE) of the calibration set were 0.992 and 0.177%, respectively, and the R 2 and RMSE of the test set were 0.998 and 0.128%, respectively. The R 2 of our LASSO–GWF–PLS prediction model was improved from 0.975 to 0.998, indicating that the LASSO–GWF–PLS method has very good prediction ability for detection of SSC in pears.

Keywords: near-infrared spectroscopy; soluble solids content; pear; fusion variable selection algorithm; modeling (search for similar items in EconPapers)
JEL-codes: Q1 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2077-0472/13/8/1491/pdf (application/pdf)
https://www.mdpi.com/2077-0472/13/8/1491/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jagris:v:13:y:2023:i:8:p:1491-:d:1203638

Access Statistics for this article

Agriculture is currently edited by Ms. Leda Xuan

More articles in Agriculture from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-22
Handle: RePEc:gam:jagris:v:13:y:2023:i:8:p:1491-:d:1203638