QoALa: A comprehensive workflow for viral quasispecies diversity comparison using long-read sequencing data

QoALa:一种利用长读长测序数据进行病毒准种多样性比较的综合工作流程

阅读:1

Abstract

The concept of viral quasispecies refers to a constantly mutating viral population occurring within hosts, which is essential for grasping the micro-evolutionary patterns of viruses. Despite its high error rate, long-read sequencing holds potential for advancing viral quasispecies research by resolving coverage limitations in next-generation sequencing. We introduce a refined workflow, QoALa, implemented in the longreadvqs R package. This workflow begins with nucleotide position-wise noise minimization of read alignments and sample size standardization, and extends to viral quasispecies comparison across related samples with integrated visualization capabilities. Benchmarking on simulated SARS-CoV-2 and HIV-1 datasets demonstrated that QoALa consistently outperformed existing error-correction methods in recovering quasispecies composition, particularly in preserving nucleotide diversity and hierarchical population structure. Real raw read samples from five studies of different viruses (HCV, HBV, HIV-1, SARS-CoV-2, and IAV), sequenced by major long-read platforms, were also used to evaluate these approaches. The comparative results provide novel insights into intra- and inter-host diversity dynamics in various scenarios and unveil rare haplotypes not reported in the original studies, underscoring the versatility and practicality of our methodology.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。