ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data

ESPRESSO：从容易出错的长读 RNA 测序数据中稳健地发现和量化转录异构体

阅读：8

作者：Yuan Gao, Feng Wang, Robert Wang, Eric Kutschera, Yang Xu, Stephan Xie, Yuanyuan Wang, Kathryn E Kadash-Edmondson, Lan Lin, Yi Xing

期刊：

Science Advances

影响因子：

11.700

时间：

2023

起止号：

2023 Jan 20;9(3):eabq5072.

doi：

10.1126/sciadv.abq5072

研究方向：

信号转导

Abstract

Long-read RNA sequencing (RNA-seq) holds great potential for characterizing transcriptome variation and full-length transcript isoforms, but the relatively high error rate of current long-read sequencing platforms poses a major challenge. We present ESPRESSO, a computational tool for robust discovery and quantification of transcript isoforms from error-prone long reads. ESPRESSO jointly considers alignments of all long reads aligned to a gene and uses error profiles of individual reads to improve the identification of splice junctions and the discovery of their corresponding transcript isoforms. On both a synthetic spike-in RNA sample and human RNA samples, ESPRESSO outperforms multiple contemporary tools in not only transcript isoform discovery but also transcript isoform quantification. In total, we generated and analyzed ~1.1 billion nanopore RNA-seq reads covering 30 human tissue samples and three human cell lines. ESPRESSO and its companion dataset provide a useful resource for studying the RNA repertoire of eukaryotic transcriptomes.

ESPRESSO: Robust discovery and quantification of transcript isoforms from error-prone long-read RNA-seq data

ESPRESSO：从容易出错的长读 RNA 测序数据中稳健地发现和量化转录异构体

Abstract

特别声明