Rapid forensic ancestry inference in selected Northeast Asian populations: a Y-STR based attention-based ensemble framework for initial investigation guidance

针对特定东北亚人群的快速法医祖源推断:基于Y-STR的注意力集成框架在初步调查指导中的应用

阅读:2

Abstract

INTRODUCTION: Rapid inference of ancestral origin fromDNA evidence is critical in time-sensitive forensic investigations, particularly during the initial hours when crucial investigative decisions must be made. Although comprehensive analyses using multiple genetic markers provide thorough results, they often require significant processing time and resources. Y-chromosome short tandem repeats (Y-STRs) exhibit population-specific allelic distributions that facilitate rapid analysis, making them particularly valuable for initial screening in forensic contexts. METHODS: This study aims to enhance population classification accuracy using Y-STR profile analysis, with a particular focus on Northeast Asian populations that are often merged into a single group by commercial ancestry panels. We developed a machine learning architecture centered on an attention-based ensemble mechanism that incorporates three complementary algorithms: a One-vs-Rest Random Forest, XGBoost, and Logistic Regression, each configured to effectively manage imbalanced datasets. RESULTS: Utilizing only Y-STR data, the model achieved an overall accuracy of 80%-81% and demonstrated high stability. Notably, the model effectively processes imbalanced datasets, generating reliable outcomes for rapid ancestry assessment in time-critical investigations. DISCUSSION: By addressing a key limitation in commercial ancestry panels--their failure to differentiate among Northeast Asian subpopulations--this framework provides valuable preliminary guidance in forensic cases involving Asian individuals. Consequently, our approach enhances rapid screening capabilities, which can inform early-stage investigations while complementing subsequent, more comprehensive genetic analyses.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。