Patterns of population structure and genetic variation within the Saudi Arabian population

沙特阿拉伯人口的人口结构和遗传变异模式

阅读:1

Abstract

The Arabian Peninsula is considered the initial site of historic human migration out of Africa. The modern-day indigenous Arabians are believed to be the descendants who remained from the ancient split of the migrants into Eurasia. Here, we investigated how the population history and cultural practices such as endogamy have shaped the genetic variation of the Saudi Arabians. We genotyped 3,352 individuals and identified twelve genetic sub-clusters that corresponded to the geographical distribution of different tribal regions, differentiated by distinct components of ancestry based on comparisons to modern and ancient DNA references. These sub-clusters also showed variation across ranges of the genome covered in runs of homozygosity, as well as differences in population size changes over time. Using 25,488,981 variants found in whole genome sequencing data (WGS) from 302 individuals, we found that the Saudi tend to show proportionally more deleterious alleles than neutral alleles when compared to Africans/African Americans from gnomAD (e.g. a 13% increase of deleterious alleles annotated by AlphaMissense between 0.5 - 5% frequency in Saudi, compared to 7% decrease of the benign alleles; P < 0.001). Saudi sub-clusters with greater inbreeding and lower effective population sizes showed greater enrichment of deleterious alleles as well. Additionally, we found that approximately 10% of the variants discovered in our WGS data are not observed in gnomAD; these variants are also enriched with deleterious annotations. To accelerate studying the population-enriched deleterious alleles and their health consequences in this population, we made available the allele frequency estimates of 25,488,981 variants discovered in our samples. Taken together, our results suggest that Saudi's population history impacts its pattern of genetic variation with potential consequences to the population health. It further highlights the need to sequence diverse and unique populations so to provide a foundation on which to interpret medical- and pharmaco- genomic findings from these populations.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。