Integrated Metabolomics-KPCA-Machine Learning framework: a solution for geographical traceability of Chinese Jujube

整合代谢组学-KPCA-机器学习框架:一种用于中国红枣地理溯源的解决方案

阅读:2

Abstract

Due to widespread product adulteration, Chinese jujube (CJ), a crop of global economic importance with nutritional and medicinal properties, struggles with geographical traceability. The study introduced a Metabolomics-Kernel Principal Component Analysis (KPCA)-Machine Learning (ML) framework to set up an origin identification system for CJ from six production regions in China (Xinjiang, Gansu, Shaanxi, Henan, Shandong, and Hebei). Using LC-MS/MS for untargeted metabolomics, researchers identified 312 metabolites. Multivariate analysis revealed 37 key discriminant variables (VIP > 1). KPCA compressed these features into 28 principal components (retaining 90.59 % information). Compared with the traditional method, the K-means clustering after dimensionality reduction of KPCA greatly improves the sample differentiation ability: the origin samples with original data overlap with fuzzy boundaries; while after dimensionality reduction, the six origin samples form a clear and compact cluster, which achieves accurate classification. This study pioneers a "Metabolomics-KPCA-ML" paradigm, offering a solution for traceability of geographical indication agricultural products.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。