Multi-omics prediction of terpene constituents and phenolic traits in Eucalyptus globulus using Bayesian models and tree-based machine learning

利用贝叶斯模型和基于树的机器学习方法对蓝桉(Eucalyptus globulus)中的萜烯成分和酚类性状进行多组学预测

阅读:1

Abstract

BACKGROUND: Terpenes and phenolic compounds are multifunctional plant metabolites that contribute to defense, signaling, and ecological interactions, while supporting industrial applications. Here, we implemented an integrative multi-omics framework to dissect the genetic and metabolic architecture of biobased compound production in a Eucalyptus globulus breeding population, a tree species of global economic importance. The dataset comprised 14,442 high-confidence SNPs genotyped using the EUChip60K array and 3,279 haplotype blocks, complemented by phenomic datasets derived from near-infrared (NIR) spectral absorbance (~ 900–2,500 nm) and pigment-related indices (chlorophyll, anthocyanins, flavonols, and nitrogen balance index). RESULTS: Gas chromatography-mass spectrometry (GC–MS) and ultra-high-performance liquid chromatography-quadrupole-time-of-flight tandem MS (UHPLC–QqTOF–MS2) revealed substantial inter-individual variation: essential oils were dominated by 1,8-cineole (42.70% ± 7.19%) and α-pinene (38.02% ± 5.20%), while phenolic extracts were enriched in phloroglucinol derivatives (92.67% ± 1.59%), notably the macrocarpal C isomer (12.98% ± 1.48%). Total additive heritability integrating SNP- and pedigree-based effects was high for MeOH extract yield (h(2) = 0.721), total phenolics (h(2) = 0.646), α-terpineol (h(2) = 0.718), and 1,8-cineole (h(2) = 0.658), indicating strong genetic control. Predictive modeling revealed that haplotype-based Bayesian approaches outperformed SNP- and phenomic-based models, with the highest accuracies for α-terpineol (prediction accuracy = 0.717), α-pinene (prediction accuracy = 0.554), and methanolic extract yield (prediction accuracy = 0.624). Feature selection identified pleiotropic markers (e.g., SNP12778, HAP205, HAP272, NIR2429) co-localized with genes involved in photosynthesis, signaling, and metabolic regulation. CONCLUSION: These findings lay the foundation for omics-assisted E. globulus breeding aimed at generating elite genotypes for large-scale production of high-value bioactive compounds with practical applications in health, agriculture, and bioenergy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12870-026-08581-z.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。