Decoding the interconnected splicing patterns of hepatitis B virus and host using large language and deep learning models

利用大型语言和深度学习模型解码乙型肝炎病毒与宿主之间相互关联的剪接模式

阅读:3

Abstract

Hepatitis B virus (HBV) infection causes one million deaths annually and remains a major driver of hepatocellular carcinoma. Despite its compact 3.2 kb genome, HBV exhibits extensive alternative splicing. HBV splice variants contribute to immune evasion and reduce the likelihood of achieving a functional cure. Here, we show that HBV splicing efficiency - quantified from 279 RNA-sequencing libraries of HBV-associated liver biopsies and cultured cells - correlates more strongly with disease progression than the overall proportion of spliced HBV RNA, the latter of which has been proposed as an emerging biomarker. All HBV splice sites are embedded within protein-coding regions, forming a gene structure distinct from typical host splice sites. To decode the sequence determinants of HBV splicing, we apply SpliceBERT and OpenSpliceAI to 4,706 HBV genomes. These models reveal that HBV splice donor sites share features with host splice donor sites, whereas HBV splice acceptor sites are more cryptic. These patterns likely reflect constraints imposed by HBV's compact genome, which must accommodate overlapping protein-coding regions. Motif conservation and splicing propensity analyses across HBV genomes reveal context- and genotype-specific splicing patterns, indicating regulation by sequence context. HBV genotypes may have coevolved with their human hosts to exploit suboptimal but spliceable host-like motifs without disrupting their gene structure, supporting mechanisms of viral persistence and immune evasion. This study demonstrates the utility of artificial intelligence in decoding viral splicing patterns and provides a framework for investigating co-transcriptional processes in other clinically important viruses.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。