Practical utility of sequence-to-omics models for improving the reproducibility of genetic fine-mapping

序列到组学模型在提高遗传精细定位可重复性方面的实际应用

阅读:4

Abstract

Recent advances in deep learning have led to the development of sequence-to-omics (S2O) models that predict molecular phenotypes directly from DNA sequences. Here, we systematically evaluate the utility of these models, e.g., AlphaGenome, Borzoi, Enformer, and Sei, for improving the reproducibility of genetic fine-mapping across expression quantitative trait loci (eQTL) datasets from Genotype-Tissue Expression (GTEx), Trans-Omics Precision Medicine (TOPMed), and Multi-Ancestry Analysis of Gene Expression (MAGE) projects. We show that purely statistical fine-mapping often yields high replication failure rates (RFRs), but integrating S2O model predictions substantially reduces RFRs and enhances the accuracy of prioritizing SNPs replicated in other consortia. We describe a generalized framework for functionally informed fine-mapping that combines traditional posterior inclusion probabilities (PIPs) from statistical fine-mapping methods with scores from S2O models to generate functionally informed PIPs (fiPIPs) that improve reproducibility. Our findings demonstrate that S2O models, particularly newer ones like AlphaGenome and Borzoi, enable robust identification of replicated variants across consortia, highlighting their promise for scalable, functionally aware genetic mapping.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。