Polygenic risk scores (PRS) are commonly used to quantify the inherited susceptibility for a trait, yet they fail to account for non-linear and interaction effects between single nucleotide polymorphisms (SNPs). We address this via a machine learning approach, validated in nine complex phenotypes in a multi-ancestry population. We use an ensemble method of SNP selection followed by gradient boosted trees (XGBoost) to allow for non-linearities and interaction effects. We compare our results to the standard, linear PRS model developed using PRSice, LDpred2, and lassosum2. Combining a PRS as a feature in an XGBoost model results in a relative increase in the percentage variance explained compared to the standard linear PRS model by 22% for height, 27% for HDL cholesterol, 43% for body mass index, 50% for sleep duration, 58% for systolic blood pressure, 64% for total cholesterol, 66% for triglycerides, 77% for LDL cholesterol, and 100% for diastolic blood pressure. Multi-ancestry trained models perform similarly to specific racial/ethnic group trained models and are consistently superior to the standard linear PRS models. This work demonstrates an effective method to account for non-linearities and interaction effects in genetics-based prediction models.
Non-linear machine learning models incorporating SNPs and PRS improve polygenic prediction in diverse human populations.
阅读:4
作者:Elgart Michael, Lyons Genevieve, Romero-Brufau Santiago, Kurniansyah Nuzulul, Brody Jennifer A, Guo Xiuqing, Lin Henry J, Raffield Laura, Gao Yan, Chen Han, de Vries Paul, Lloyd-Jones Donald M, Lange Leslie A, Peloso Gina M, Fornage Myriam, Rotter Jerome I, Rich Stephen S, Morrison Alanna C, Psaty Bruce M, Levy Daniel, Redline Susan, Sofer Tamar
| 期刊: | Communications Biology | 影响因子: | 5.100 |
| 时间: | 2022 | 起止号: | 2022 Aug 22; 5(1):856 |
| doi: | 10.1038/s42003-022-03812-z | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
