A Novel Approach to Pine Nut Classification: Combining Near-Infrared Spectroscopy and Image Shape Features with Soft Voting-Based Ensemble Learning
Yueyun Yu,
Xin Huang,
Danjv Lv,
Benjamin K. Ng () and
Chan-Tong Lam
Additional contact information
Yueyun Yu: Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
Xin Huang: College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming 650224, China
Danjv Lv: College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming 650224, China
Benjamin K. Ng: Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
Chan-Tong Lam: Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
Mathematics, 2025, vol. 13, issue 12, 1-21
Abstract:
Pine nuts hold significant economic value due to their rich plant protein and healthy fats, yet precise variety classification has long been hindered by limitations of traditional techniques such as chemical analysis and machine vision. This study proposes a novel near-infrared (NIR) spectral feature selection algorithm, termed the improved binary equilibrium optimizer with selection probability (IBiEO-SP), which incorporates a dynamic probability adjustment mechanism to achieve efficient feature dimensionality reduction. Experimental validation on a dataset comprising seven pine nut varieties demonstrated that, compared to particle swarm optimization (PSO) and the genetic algorithm (GA), the IBiEO-SP algorithm improved average classification accuracy by 5.7% ( p < 0.01, Student’s t -test) under four spectral preprocessing methods (MSC, SNV, SG1, and SG2). Remarkably, only 2–3 features were required to achieve optimal performance (MSC + random forest: 99.05% accuracy, 100% F1/precision; SNV + KNN: 97.14% accuracy, 100% F1/precision). Furthermore, a multimodal data synergy strategy integrating NIR spectroscopy with morphological features was proposed, and a classification model was constructed using a soft voting ensemble. The final classification accuracy reached 99.95%, representing a 2.9% improvement over single-spectral-mode analysis. The results indicate that the IBiEO-SP algorithm effectively balances feature discriminative power and model generalization needs, overcoming the contradiction between high-dimensional data redundancy and low-dimensional information loss. This work provides a high-precision, low-complexity solution for rapid quality detection of pine nuts, with broad implications for agricultural product inspection and food safety.
Keywords: IBiEO-SP; pine nut; near-infrared spectroscopy; feature selection; ensemble learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/12/2009/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/12/2009/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:12:p:2009-:d:1681908
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().