Data-driven insights to inform splice-altering variant assessment

数据驱动的洞见可用于指导剪接改变变异体的评估

阅读:1

Abstract

Disease-causing genetic variants often disrupt mRNA splicing, an intricate process that is incompletely understood. Thus, accurate inference of which genetic variants will affect splicing and what their functional consequences will be is challenging, particularly for variants outside of the essential splice sites. Here, we describe a set of data-driven heuristics that inform the interpretation of human splice-altering variants (SAVs) based on the analysis of annotated exons, experimentally validated SAVs, and the currently understood principles of splicing biology. We defined requisite splicing criteria by examining around 202,000 canonical protein-coding exons and 19,000 experimentally validated splicing branchpoints. This analysis defined the sequence, spacing, and motif strength required for splicing, with 95.9% of the exons examined meeting these criteria. By considering over 12,000 experimentally validated variants from the SpliceVarDB, we defined a set of heuristics that inform the evaluation of putative SAVs. To ensure the applicability of each heuristic, only those supported by at least 10 experimentally validated variants were considered. This allowed us to establish a measure of spliceogenicity: the proportion of variants at a location (or motif site) that affected splicing in a given context. This study makes considerable advances toward bridging the gap between computational predictions and the biological process of splicing, offering an evidence-based approach to identifying SAVs and evaluating their impact. Our splicing heuristics enhance the current framework for genetic variant evaluation with a robust, detailed, and comprehensible analysis by adding valuable context over traditional binary prediction tools.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。