A unified meta-regression model identifies genes associated with epilepsy

统一的元回归模型识别与癫痫相关的基因

阅读:1

Abstract

Epilepsy is a highly heterogeneous disorder thought to have strong genetic components. However, identifying these risk factors using whole-exome sequencing studies requires very large sample sizes and good signal-to-noise ratio in order to assess the association between rare variants in any given gene and disease. We present an approach for predicting constraint in the human genome through application of a Hidden Markov Model (HMM) to whole exome sequencing (WES) data. Using the Regeneron Genetics Center Million Exome dataset and the AllofUs whole genome sequencing data, we predict the probability of observing no variants across the population for each position in the genome. We then incorporate the predictions with the "rejected substitutions" (RS) score from Genomic Evolutionary Rate Profiling (GERP), pathogenicity predictions from AlphaMissense (AM), and pLoF/Missense annotations from Epi25 into a model that detects epilepsy-associated genes. We identify a set of significant ( p < 3. 4 × 10 (-7) ) genes which did not meet exome-wide significance in previous studies: KCNQ2, SCN2A, STXBP1, CACNA1A, SLC6A1, DYRK1A, KCNB1, SATB1, PCDHAC2, SP4 , and RYR2 ,. Our models allow us to evaluate the contribution of constraint, protein structure based pathogenicity prediction from AM, and pLoFs jointly. We show that unifying these moderators into a single model allows us to both strengthen our evidence for genes with already-known links to epilepsy and also identify new genes with likely links to epilepsy.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。