Abstract
An origin discrimination model for rice from five production regions in Heilongjiang Province was constructed based on the combination of confocal microscopy Raman spectroscopy and chemometrics. A total of 150 field rice samples were collected from the Fangzheng, Chahayang, Jiansanjiang, Xiangshui, and Wuchang production areas. The optimal sample processing conditions, instrument parameter settings, and spectrum acquisition techniques were identified by investigating the influencing factor. The Raman spectra of milled rice within the range of 100-3200 cm(-1) were selected as the raw data, and the optimal preprocessing method combination consisting of normalization, Savitzky-Golay smoothing, and multivariate scatter correction was identified. Subsequently, the competitive adaptive reweighted sampling and discrete binary particle swarm optimization algorithms were employed to optimize the feature wavelength selection, resulting in the screening of 226 and 1899 feature wavelength variables, respectively. Using the full Raman spectrum data and feature wavelength data as inputs, partial least squares discriminant analysis, support vector machine and extreme learning machine origin discrimination models were constructed. The results indicated that the BPSO-GA-SVM model exhibited the best predictive ability, achieving a testing set accuracy of 86.67%.