Gene-centered representation of coding and regulatory variation enables outcome prediction

以基因为中心的编码和调控变异表征能够预测结果。

阅读:2

Abstract

Integrating coding and regulatory variation into unified, interpretable representations remains a challenge in functional genomics. Current approaches either focus on common variants or analyze individual variants in isolation, missing the cumulative, cell-type-specific impact of both coding and noncoding variants on each gene. We present Volaria, a computational framework that integrates coding and regulatory genetic variation into unified, gene-centered representations for disease outcome prediction from whole-genome sequencing. Volaria leverages deep learning models to capture variant effects on cell-type-specific gene expression and integrates them with AI-predicted exonic variant pathogenicity to produce representations that capture the cumulative effect of genome-wide rare and common variation. Applied to whole genomes of individuals with rare glomerular diseases, Volaria predicts individual outcomes directly from germline sequence, demonstrating that structured, cell-type-aware representations capture predictive signals beyond population-based polygenic risk scores and unstructured representations. Importantly, the framework identifies context-specific biological mechanisms, providing interpretability that can be aligned with clinical measurements. By encoding genome-wide variation into compact and biologically grounded representations, Volaria provides a scalable foundation for genome interpretation and individualized outcome modeling from germline sequence, complementing phenotypic and clinical information in the future integrative frameworks.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。