Abstract
BACKGROUND: The dried ripe fruit or seed of Amomun tsaoko is a widely used spice and food additive in Eastern and Southeastern Asia. Approximately 90% of the global production of this spice occurs in Yunnan province, China. Over years of cultivation, genetic variations have emerged, leading to wide regional varieties. Authenticating geographical origin has become essential for quality assessment and control, as it directly influences a product's commercial value. OBJECTIVE: This study aims to authenticate the geographical origins of A. tsaoko seeds sourced from distinct and narrow geographical regions. METHODS: Near-infrared spectroscopy (NIRS) combined with machine learning (ML) techniques was used to determine the specific geographical origins of A. tsaoko seeds. RESULTS: The results demonstrated that Fourier transform Near-infrared spectroscopy (FT-NIR) followed by a multi-layer perceptron (MLP) was the optimal strategy among all methods tested. This approach achieved a high accuracy of 96.97%. Additionally, feature dimensionality reduction analysis was applied using the Catboost model. This analysis identified certain spectral ranges that contained important features for the model. CONCLUSION: This study indicates that pretreatment of NIRS raw data and the use of ML are potential strategies for rapid and specific geographic authentication of plants.