Risk factors affecting polygenic score performance across diverse cohorts

影响多基因评分在不同人群中表现的风险因素

阅读:1

Abstract

Apart from ancestry, personal or environmental covariates may contribute to differences in polygenic score (PGS) performance. We analyzed effects of covariate stratification and interaction on body mass index (BMI) PGS (PGS(BMI)) across four cohorts of European (N=491,111) and African (N=21,612) ancestry. Stratifying on binary covariates and quintiles for continuous covariates, 18/62 covariates had significant and replicable R(2) differences among strata. Covariates with the largest differences included age, sex, blood lipids, physical activity, and alcohol consumption, with R(2) being nearly double between best and worst performing quintiles for certain covariates. 28 covariates had significant PGS(BMI)-covariate interaction effects, modifying PGS(BMI) effects by nearly 20% per standard deviation change. We observed overlap between covariates that had significant R(2) differences among strata and interaction effects - across all covariates, their main effects on BMI were correlated with their maximum R(2) differences and interaction effects (0.56 and 0.58, respectively), suggesting high-PGS(BMI) individuals have highest R(2) and increase in PGS effect. Using quantile regression, we show the effect of PGS(BMI) increases as BMI itself increases, and that these differences in effects are directly related to differences in R(2) when stratifying by different covariates. Given significant and replicable evidence for context-specific PGS(BMI) performance and effects, we investigated ways to increase model performance taking into account non-linear effects. Machine learning models (neural networks) increased relative model R(2) (mean 23%) across datasets. Finally, creating PGS(BMI) directly from GxAge GWAS effects increased relative R(2) by 7.8%. These results demonstrate that certain covariates, especially those most associated with BMI, significantly affect both PGS(BMI) performance and effects across diverse cohorts and ancestries, and we provide avenues to improve model performance that consider these effects.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。