A simple approach for multiple observations improves power to detect genetic effects and genomic prediction accuracy

针对多次观测,一种简便的方法可以提高检测遗传效应的能力和基因组预测的准确性。

阅读:1

Abstract

Many datasets, including widely used biobanks, have more than one observation of numerous phenotypes for at least a portion of their sample. The majority of genome-wide association studies (GWASs) utilize only a single observation per individual, even when more than one observation may be available, and apply a standard model in which the additive allelic effect being estimated is assumed to be constant across the age or time range in the sample. Here, we test a set of simple approaches to utilize multiple observations per individual, under this same assumption, to characterize effects on GWAS power, SNP heritability, gene set enrichment, and polygenic prediction. We find that utilizing the mean or median of the available observations rather than a single observation improves the power to detect associated loci and enriched gene sets and yields higher out-of-sample polygenic score prediction accuracy. Despite growing biobanks, many deeply phenotyped samples are relatively small but have multiple observations. While explicitly modeling age- or time-dependent genetic effects can add nuance to genetic studies and estimates, most GWASs apply a standard, additive-only model; a simple approach of using the mean or median can improve power by reducing "noise" in the phenotype, utilize standard, optimized software, and be particularly impactful for smaller samples, including samples of diverse genetic ancestry existing in widely used biobanks such as the UK Biobank and the Health and Retirement Study.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。