A70 Viromes of two migratory birds as support for identifying the influence of molecular biology and bioinformatics settings

以两种候鸟的A70病毒组为研究对象,探讨分子生物学和生物信息学设置的影响

阅读:2

Abstract

Thanks to high-throughput sequencing (HTS) and dedicated bioinformatics tools, it is possible to study all viruses present in a sample with an unprecedented detection of viral diversity. Applied to wildlife and humans, HTS becomes a powerful means to understand viral emerging infectious diseases. BLAST searches to find remote homolog sequences after an assembly of shotgun-generated sequences are often used to analyze virome datasets. However, this process might be biased by molecular biology approaches (use of whole genome/transcriptome amplification) and bioinformatics assembly that can induce chimeric sequences formation. Our objective was to identify confounding factors (molecular biology and bioinformatics) that can bias virome composition. Hence, we studied the viromes from cloacal swab samples of two migratory birds. Viral DNA and RNA were extracted using three different methods (easymag(®), phenol, and trizol), followed by whole genome amplification (WGA)/WTA amplifications, before Illumina MiSeq sequencing. After trimming and assembly, sequences underwent BLAST (BLASTn and BLASTx). At the end of the first BLAST (n/x), some portions of sequences were uncovered and unassigned. We, then, implemented a novel recursive split-resubmit python program that searched for homologs of > 50 base pairs (bp), uncovering parts for a better exploitation of the datasets. Thirty-eight known viral families were detected in our samples. Circoviridae, Parvoviridae, and Microviridae were found in the majority of the results from easymag(®) and Phenol extracted samples, while trizol samples resulted in a majority of Picornaviridae and Coronaviridae. These results show that the viromes are of different composition depending on extraction methods. Most of the viral sequences were predominantly identified at the blastx step reflecting a high level of divergence with known viral sequences. Compared to a unique BLAST search, our resubmission pipeline allowed the assignment of up to 23 per cent of the viral sequences. Most of these sequences belonged to taxa already observed at the first blast step and few viral sequences of different families were detected. Knowing the complementarities between extraction methods and a better identification of viral sequences using our pipeline, the next step will be to get a precise position of each viral genomic segment in phylogenetic distance-based tree that will give a more accurate representation of viral diversity.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。