Whole Genome Resequencing of 205 Avocado Trees Unveils the Genomic Patterns of Racial Divergence in the Americas

对205棵鳄梨树进行全基因组重测序,揭示美洲种族分化的基因组模式

阅读:1

Abstract

Avocado (Persea americana Mill.) is one of the most widely consumed fruits worldwide. The tree species is traditionally classified into three botanical races: Mexican, Guatemalan, and West Indian (with a potentially distinct Colombian genepool). However, previous studies using molecular markers, such as AFLPs, microsatellites (SSRs), and GBS-derived SNP markers, have only partially resolved this racial divergence, especially in the hyper agrobiodiverse region of northwest South America. Therefore, in order to confirm genetic identity and origin of "criollo" avocado cultivars in the region, as well as to improve their traceability as rootstocks for the Hass variety, we performed low-coverage whole genome resequencing (lcWGS) on 205 ex situ conserved tree samples, comprising 42 commercial varieties and 163 "criollo" trees from various provinces in Colombia. This characterization yielded a total of 64,310,961 SNPs at an average coverage of 4.69×. Population structure analysis using principal component analysis (PCA) and ADMIXTURE retrieved at least five genetic clusters (K = 5), partly confirmed by Bayesian phylogenetic inference. Three clusters matched the recognized Mesoamerican botanical races (Mexican, Guatemalan, and West Indian), and two clusters reinforced the distinctness of two novel Andean and Caribbean Colombian genetic groups. Finally, in order to retrieve high-quality SNP markers for racial screening, a second genomic dataset was filtered, consisting of 68 avocado tree samples exhibiting more than 80% ancestry to a given racial cluster, and 9826 SNPs with a minimum allele frequency (maf) of 5%, a minimum sequencing depth (SD) of 10× per position, and missing data per variant not exceeding 20% (i.e., variants with genotypes present in at least 80% of the samples). This racially segregating high-quality subset was analyzed against the racial substructure using linear mixed models (LMMs), enabling the identification of 254 SNP markers associated with the five avocado genetic races. The previous candidate SNPs may be leveraged by nurseries and producers through a high-throughput SNP screening system for the racial traceability of seedling donor trees, saplings, and rootstocks. These genomic resources will support the selection of regionally adapted elite rootstocks and represent a landmark in Colombian horticulture as the first large-scale lcWGS-based characterization of a local avocado germplasm collection.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。