Examining the Performance of Polygenic Risk Scores for Alzheimer Disease Within and Across Populations Using k-Fold Cross-Validation

利用k折交叉验证检验多基因风险评分在人群内部和人群之间对阿尔茨海默病预测性能的影响

阅读:3

Abstract

BACKGROUND AND OBJECTIVES: Alzheimer disease (AD) has a complex etiology with a strong genetic component. Despite mounting evidence that genetic risk effect sizes vary by population, most research on the genetics of AD has examined only data sets of individuals with European ancestry. In this study, we investigate the variable performance and transferability of polygenic risk scores (PRSs) by deriving a PRS from analyses of AD for various race and ethnic categories and applying this across groups using a k-fold cross-validation approach. METHODS: After quality control and application of inclusion criteria, we analyzed 11,254 individuals from the Alzheimer's Disease Sequencing Project Release 3 with 3 predominant self-identified race/ethnicity groups: Hispanic individuals (n = 2,207), non-Hispanic Black individuals (n = 2,437), and non-Hispanic White individuals (n = 6,610). For each group, a 5-fold cross-validation approach was used to perform a genome-wide association study (training) and construct PRS estimates (test). This approach allowed for consideration of within-group and across-group PRS performance. Area under the curve (AUC) was calculated for each race/ethnicity-specific PRS within each cross-validation fold. Various PRS criteria were considered, including rare single-nucleotide polymorphisms and the APOE region in calculation, sex-specific PRSs, and different sample sizes of training and test groups, and their effects were examined through goodness-of-fit metrics in reduced and full models. RESULTS: Across all participant groups, the PRS trained in the same race/ethnicity category as the test group nearly exclusively outperformed the other PRSs with considerable benefit in the Hispanic group. The range of AUCs across iterations was also greater when applying PRSs from other groups. These findings were consistent after inclusion of APOE, sex, and age covariates and through different PRS generation criteria. DISCUSSION: In this study, we demonstrated the variable performance of AD PRSs within race/ethnicity groups and the weakness in transferability across groups, accounting for different criteria in PRS calculation. This work provided a framework for improving PRS application for AD and highlighted the importance of recruiting diverse populations in genetic studies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。