StringTie3 Improves Total RNA-seq Assembly by Resolving Nascent and Mature Transcripts

StringTie3 通过解析新生和成熟转录本来提高 RNA-seq 总组装质量

阅读:1

Abstract

Accurate assembly of rRNA-depleted (total) RNA-seq remains challenging because existing methods often conflate incomplete, nascent RNA with fully processed mature isoforms, leading to misassemblies and quantification errors that skew downstream analyses. Here, we present StringTie3, a major update to the widely used StringTie assembler, specifically designed for total RNA-seq. This new version introduces two key innovations: (1) a nascent mode that models co-transcriptional splicing to separate nascent from mature transcripts, and (2) a refined long-read module that distinguishes genuine polyadenylation sites from poly(A)-priming artifacts. Across short-, long-, and hybrid-read datasets, StringTie3 substantially reduces assembly errors and outperforms existing tools, boosting precision by up to 20% in short-read total RNA-seq and improving sensitivity and precision by as much as 37% and 75%, respectively, in long-read assemblies. In Argonaute knockout experiments, nascent-mode analysis shows that single knockouts predominantly alter nascent transcripts while leaving mature RNA largely unchanged, whereas double or triple knockouts disrupt both fractions. Applying this approach to breast cancer samples shows that, although nascent and mature RNA levels often correlate, certain extracellular matrix and tumor suppressor genes deviate from this pattern, suggesting post-transcriptional regulation. By accurately reconstructing transcriptomes and distinguishing nascent from mature RNA, StringTie3 reveals hidden layers of RNA regulation and provides a powerful framework for investigating transcriptional and post-transcriptional processes in total RNA-seq data.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。