Reconstructing SARS-CoV-2 lineages from mixed wastewater sequencing data

从混合废水测序数据重建 SARS-CoV-2 谱系

阅读:12

Abstract

Wastewater surveillance of SARS-CoV-2 has emerged as a critical tool for tracking the spread of COVID-19. In addition to estimating the relative case numbers using quantitative PCR, SARS-CoV-2 genomic RNA can be extracted from wastewater and sequenced. There are many existing techniques for using the sequenced RNA to determine the relative abundance of known lineages in a sample. However, it is very challenging to predict novel lineages from wastewater data due to its mixed composition and unreliable genomic coverage. In this work, we present a novel technique based on non-negative matrix factorization which is able to reconstruct lineage definitions by analyzing data from across different samples. We test the method both on synthetic and real wastewater sequencing data. We show that the technique is able to determine major lineages such as Omicron and Delta as well as sub-lineages such as BA.5.2.1. We provide a method for determining emerging lineages in wastewater without the need for genomic data from clinical samples. This could be used for routine monitoring of SARS-CoV-2 as well as other emerging viral pathogens in wastewater. Additionally, it may be used to determine more full-genome sequences for viruses with fewer available genomes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。