Strategies in Global Ancestry and Local Ancestry Inference

全球祖源和局部祖源推断策略

阅读:1

Abstract

Genetic ancestry inference has become essential in population and medical genetics, especially for studies of admixed populations. Accurate determination of both global ancestry (GA) proportions and local ancestry (LA) segmental origins requires careful selection of computational methods and reference panels. Here, we present a practical, protocol-oriented guide that (i) clarifies key concepts (GA vs. LA, reference panel selection, phasing requirements), (ii) organizes methods into model-based clustering and dimensionality-reduction approaches for GA and hidden Markov model-based, window-based machine learning, and deep learning frameworks for LA and (iii) provides concise guidance on tool selection for GA and LA. Step-by-step protocols are provided for a typical ADMIXTURE-based GA analysis and for a SHAPEIT5 + RFMix LA inference pipeline, with practical considerations for genotype array and whole-genome sequencing data. We also discuss quality control, method validation, and downstream applications of ancestry inference. Finally, we address current challenges and highlight recent advances, including fast algorithms, deep learning models, improved phasing, and integrative tools. This guide aims to help researchers select and implement appropriate ancestry inference methods for diverse study designs and datasets. © 2026 The Author(s). Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Global ancestry analysis (ADMIXTURE pipeline) Basic Protocol 2: Local ancestry analysis (phasing + RFMix pipeline).

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。